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Abstract. Two concepts, evidently very different in nature, have proved to be useful in analytical 
and numerical studies of spectral stability in nonlinear wave theory: (i) the Krcin signature of an 
eigenvalue, a quantity usually defined in terms of the relative orientation of certain subspaccs that is 
capable of detecting the structural instability of imaginary eigenvalues and hence their potential for 
moving into the right half-plane leading to dynamical instability under perturbation of the system, 
and (ii) the Evans function, an analytic function detecting the location of eigenvalues. One might 
expect these two concepts to be related, but unfortunately examples demonstrate that there is no 
way in general to deduce the Krein signature of an eigenvalue from the Evans function, for example 
by studying derivatives of the latter. 

The purpose of this paper is to recall and popularize a simple graphical interpretation of the Krein 
signature well-known in the spectral theory of polynomial operator pencils. Once established, this 
interpretation avoids altogether the need to view the Krein signature in terms of root subspaces and 
their relation to indefinite quadratic forms. To demonstrate the utility of this graphical interpretation 
of the Krein signature, we use it to define a simple generalization of the Evans function — the 
Evans-Krein function — that allows the calculation of Krcin signatures in a way that is easy to 
incorporate into existing Evans function evaluation codes at virtually no additional computational 
cost. The graphical interpretation of the Krcin signature also enables us to give elegant proofs of index 
theorems for linearized Hamiltonians in the finite dimensional setting: a general result implying as 
a corollary the generalized Vakhitov-Kolokolov criterion (or Grillakis-Shatah-Strauss criterion) and 
a count of real eigenvalues for linearized Hamiltonian systems in canonical form. These applications 
demonstrate how the simple graphical nature of the Krein signature may be easily exploited. 
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1. Introduction. This paper concerns relations among several concepts that 
are commonly regarded as useful in the analysis of (generalized) eigenvalue problems 
such as those that occur in the stability theory of nonlinear waves: Krein signatures 
of eigenvalues, Evans functions, and index theorems (also known as inertia laws or 
eigenvalue counts) governing the spectra of pairs of related operators. The most 
elementary setting in which many of these notions appear is the stability analysis of 
equilibria for finite-dimensional Hamiltonian systems, and we take this opportunity 
right at the beginning of the paper to introduce the key ideas in this simple setting, 
including a beautiful and simple graphical method of analysis. 

1.1. A graphical method for linearized Hamiltonians. The most impor- 
tant features of the linearized dynamics of a finite dimensional Hamiltonian system 
close to a critical point are determined by the values of v in the spectrum a( JL) of 
the (nonselfadjoint) problem 

JLu = vu, L* :=Zt = L, J* = -J. (1.1) 
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Here J is an invertible skew-Hermitian matrix and L a Hermitian matrix of the same 
dimension (both over the complex numbers C)Q The conditions on J require that the 
dimension of the space be even and hence L and J have dimension 2n x In. Indeed, 
a Hamiltonian system linearized about an equilibrium takes the form 



dt 



= JLy, 



(1.2) 



where y denotes the displacement from equilibrium. The spectral problem ( 1.1 ) arises 
by looking for solutions growing as e vt by making the substitution y(t) — e ut u, and we 
say that (1.2) is spectrally stable if a( JL) consists of only purely imaginary numbers. 
The points v e er(JL) for which Re{^} ^ are called the unstable spectrum^of (1.2). 
The linearized energy associated with the equation (1.2) is simply the quadratic form 
E[u] := (Lu, u), and the fundamental conservation law corresponding to ( |1.2[ ) is that 
dE[y]/dt = on all solutions y = y(t). 

With the use of information on the spectrum a(L) C R of L our goal is to 
characterize the spectrum cr(JL) C C of JL and in particular to determine (i) the part 
of a(JL) in the open right half of the complex plane (i.e., the unstable spectrum) and 
(ii) the potential for purely imaginary points in a(JL) to collide on the imaginary axis 
under suitable perturbations of the matrices resulting in bifurcations of Hamiltonian- 
Hopf type [S51 [55] in which points of er(JL) leave the imaginary axis and further 
destabilize the system. 

The key idea is to relate the purely imaginary spectrum of to invertibility 
of the selfadjoint (Hermitian) linear pencil £(A) (see Section[2]for a proper definition 
of operator pencils) defined by 



£ = £(A) := L — XK, 



K := (iJ)~ 



K* 



iv € 



(1.3) 



Indeed, upon multiplying (1.1) by — iJ one sees that (1.1) is equivalent to the 
generalized spectral problem 



Lu = XKu . 



(1.4) 



The graphical method [331 1211 EH EZ] is based on the observation that the imaginary 
spectrum of (1.1), or equivalently the real spectrum A of (1.4), can be found by 



studying the spectrum of the eigenvalue pencil £(A) and the way it depends on A G 
i.e., by solving the selfadjoint eigenvalue problem 



£(A)u(A) = (L - XK)u(X) = /i(A)«(A) 



(1.5) 



parametrized by A € K (see Fig. |1.1[ ). Clearly, v = —iX is a purely imaginary number 
in a(JL) if and only if A £ K is a value for which £(A) has a nontrivial kernel, that 
is, € cr(£(A)). It is also easy to see from the right-hand plot in Fig. |1.1| that 
the particular values A for which £ cr(£(A)) correspond in a one-to-one fashion to 



On notation: wc use V to denote the complex conjugate of a complex number v, while A* 
denotes the conjugate transpose of A, or more generally when an inner product is understood, the 
adjoint operator of A. We use (u,v) to denote an inner product on vectors u and v, linear in u and 
conjugate-linear in v. 

2 Although the points v with Rc{i/} < correspond to decaying solutions of i | I .L'| . due to the 
basic Hamiltonian symmetry of (JTTTj) they always go hand-in-hand with points —V in the right half- 
plane that are associated with exponentially growing solutions of jl.2) , explaining why they are also 
included in the unstable spectrum. 
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Fig. 1.1. The key to the graphical method lies in the correspondence of purely imaginary points 
v in the spectrum of JL with zeros of spectrum of the pencil C(X) = L — \K, where K := (iJ)~ 1 . 



intercepts of curves fi = (J>(\) € <r(£(A)) with the axis \i = 0. This identification also 



holds true for spectrum of higher multiplicity (for details see Theorem 3.1). 

To make a clear distinction between the two types of spectrum that are related by 
the above simple argument we now introduce the following terminology. The points 
A in ia(JL) will be called characteristic values of C with corresponding invariant 
root spaces spanned by generalized characteristic vectors or root vectors. Therefore 
a purely imaginary point in o~{JL) corresponds to a real characteristic value of £, 



but the characteristic values of C are generally complex (see Definition 2.1 1. On 
the other hand, given an arbitrary real number A, the points // in o~(£(\)) will be 
called eigenvalues of C(X) with corresponding eigenspaces spanned by (genuine, due 
to selfadjointness as A € R) eigenvectors. When we consider how fi — /i(A) depends 
on A £ R we will frequently call n(X) an eigenvalue branch. 

1.2. Use of the graphical method to find unstable spectrum. Purely 
imaginary points in a(JL) correspond in a one-to-one fashion to real characteristic 
values of C simply by rotation of the complex plane by 90° . The most obvious advan- 
tage of the graphical method is that the presence and location of real characteristic 
values of £ can be easily read off from plots of the eigenvalue branches ft(X) of C(X) 
using the following elementary observations. Selfadjointness of L and skewadjointness 
of J imply that o~(JL) consists of pairs (^, — V), and hence the characteristic values of 
L come in complex-conjugate pairs. Indeed, if u 6 C 2 " is a (right) root vector of JL 
corresponding to v € <j(JL), then (J _1 w)* is a left root vector of JL corresponding 
to —V G a{JL). Therefore the JL-invariant subspace spanned by root spaces corre- 
sponding to non-real characteristic values of C is even-dimensional. Since the base 
space X = C 2n is also even-dimensional, the number of real characteristic values of C 
(counting multiplicities) is even, and consequently the total number of intercepts of 
eigenvalue branches /x(A) with /x = (again, counting multiplicities) must be even. 



If furthermore J and L are real matrices (see Definition 5.4 for a more general 



definition of reality in less obvious contexts) then cr(JL) also consists of pairs (v, v) and 



we say that ( 1.1 ) has full Hamiltonian symmetry. In such a case, er(£(A)) is necessarily 
symmetric with respect to the vertical (A = 0) axis, so the plots of eigenvalue branches 
fi as functions of A are left-right symmetric. This is the case illustrated in Fig. |1.1| 

Finally, the dimension of the problem limits the maximal number of intercepts of 
all branches /i(A) with the axis pi = to In. This is also true for intercepts with every 
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horizontal axis /i = /io, as one can consider the problem J(L — /j,qI)u = vu. With this 
basic approach in hand, we now turn to several elementary examples. 

< Example 1. Definite Hamiltonians. It is well-known that if L is a definite (positive 
or negative) matrix, then a(JL) is purely imaginary and nonzero. Indeed, if u G C 2 " 
is a root vector of JL corresponding to v G er(JL), then Lit = i/J~ 1 u, so taking the 
Euclidean inner product with u gives 

7^ (Lu, u) = v(J~ l u, u), 

and hence neither v nor (J~ x u, u) can be zero. Moreover, (Lu, »)eR and (J~ l u, u) G 
iR, and hence v = (Lu,u) / (J~ 1 u,u) is a purely imaginary nonzero number. 

This simple fact can also be deduced from a plot of the eigenvalue branches u(X) 
of the selfadjoint pencil C(X). Let us assume without loss of generality that L is 
positive definite. We need just three facts: 

• The 2n eigenvalue branches fi(X) may be taken to be (we say this only because 
ambiguity can arise in defining the branches if C(X) has a nonsimple, but 
necessarily semisimple, eigenvalue for some Ael) continuous functions of A. 
In fact, they may be chosen to be analytic functions of A, although we do not 
require smoothness of any sort here. 

• The 2n eigenvalue branches u(X) are all positive at A = since £(0) = L. 

• By simple perturbation theory, <r(£(X)) = — Xa(K ) + 0(1) as |A| — > oo. Since 
K is Hermitian and invertible, it has m < 2n strictly positive eigenvalues and 
2n — m strictly negative eigenvalues. Hence there are exactly m eigenvalue 
branches ^(X) tending to — oo as A — > +oo, and exactly 2n — m eigenvalue 
branches fi(X) tending to — oo as A -> — oo. 

That all 2n characteristic values of £ are nonzero real numbers, and hence a(JL) C iR 
making the system ( |1.2[ ) spectrally stable, therefore appears as a simple consequence 
of the intermediate value theorem; m branches n(X) necessarily cross /j, = for A > 
and 2n — m branches cross for A < 0. Since m + (2n — m) = 2n exhausts the dimension 
of X = C 2 ™, all characteristic values have been obtained in this way. This approach 
provides the additional information that exactly m of the points in a(JL) are negative 
imaginary numbers. Note that in the case of full Hamiltonian symmetry, m = n. 

To illustrate this phenomenon, the branches /i(A) corresponding to the following 
specific choices of L positive definite and JL having full Hamiltonian symmetry: 
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are plotted in the left-hand panel of Fig. |1.2| t> 

< Example 2. Indefinite Hamiltonians and instability. If L is indefinite, then 
<t(JL) is not automatically confined to the imaginary axis. As an illustration of this 
phenomenon, consider the matrices 
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Fig. 1.2. The four eigenvalue branches /i(A) of C(X) in Examples 1-3. The real (characteristic) 
values A for which eigenvalues ^t(A) of C{X) intersect the horizontal line fi = and that correspond 
to purely imaginary points u = —i\ S cr(JL) are indicated with red dots. 



which again provide full Hamiltonian symmetry, but now L has two positive and two 
negative eigenvalues. The corresponding eigenvalue branches n(X) are plotted against 

Obviously ct(£(A)) is bounded away from zero 



1.2 



A £ K in the middle panel of Fig. 
as A <E K varies, and hence there are no real characteristic values of C, implying that 
<j(JL) n iM. = 0. The full Hamiltonian symmetry in this example implies that a(JL) is 
organized in quadruplets. Thus either <j(JL) consists of two positive and two negative 
real points, or er(JL) consists of a single quadruplet of non-real and non-imaginary 
points. Hence all of the spectrum is unstable and the graphical method easily predicts 
spectral instability in this case. \> 



< Example 3. Stability with indefinite Hamiltonians. If J and L are not both 
real the full Hamiltonian symmetry and hence the left-right symmetry of the union 



of eigenvalue branches /i(A) is broken. In the right-hand panel of Fig. 1.2 one such 
example is shown corresponding to 
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(1.6) 



Here we see one eigenvalue branch /i(A) of C(X) intersecting /i = transversely at four 
nonzero locations. Since dim(X) = 4, this implies a(JL) C iM\{0}, and the graphical 
method predicts spectral stability. Note that L is indefinite in this case showing that 
indefiniteness of L does not imply the existence of unstable spectrum. That is, while 
definiteness of L implies stability, the converse is false. > 



1.3. Use of the graphical method to obtain Krein signatures. These three 
examples have illustrated the use of the graphical method to count purely imaginary 
points in cr(JL), which are encoded as zero intercepts of the eigenvalue curves /i = 
/i(A). We now wish to dispel any impression that the location of the intercepts might 
be the only information contained in plots like those in the right-hand panel of Fig. 
[TTT1 and in Fig. [L2] This requires that we introduce briefly the notion of the Krein 
signature of a purely imaginary point v £ a(JL) n iM. 

Krein signature theory [21 [3j |45l 06] allows one to understand aspects of the 
dynamics of Hamiltonian flows near equilibria (see |58| for a recent review). Equilibria 
that are local extrema of the associated linearized energy are energetically stable, and 
this is the situation described in Example 1 above. However, even in cases when an 
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equilibrium is not a local extremum, the energetic argument can still predict stability 
if the linearized energy is definite on an invariant subspace; the fact is that many 
Hamiltonian systems of physical interest have constants of motion independent of 
the energy (e.g. momentum) and this means that effectively the linearized dynamics 
should be considered on a proper invariant subspace of the linearized phase space 
corresponding to holding the constants of motion fixed. Therefore, in such a situation 
only certain subspaces are relevant for the linearized energy quadratic form, and 
dcfiniteness of the form can be recovered under an appropriate symplectic reduction. 



Of course, for (1.2), the invariant subspaces are simply the root spaces of JL, and for 
the particular case of genuine characteristic vectors u corresponding to v G o~(JL), it 
is easy to see how the definiteness of the linearized energy relates to stability. Indeed, 
if JLu = vu, then for invertible J we have Lu = vJ~ 1 u = ivKu and by taking the 
inner product with u and using selfadjointness of L and K one obtains two identities: 

Re{is}(Ku, u) = and (Lu,u) = — Im{v}(Ku, u), (1-7) 

If v lies in the unstable spectrum, then Re-jV} ^ and the first equation requires 
that (Ku,u) = which from the second equation implies E[u] — (Lu,u) — 0, so 
the linearized energy is indefinite on the subspace. On the other hand, if v is purely 
imaginary, then the first equation is trivially satisfied but the second gives no infor- 
mation about (Lu, u). This calculation suggests that (Lu, u), or equivalently (Ku, u), 
carries nontrivial information when v G a (JL) n iR. We will define^ the Krein signa- 
ture of a simple purely imaginary point v G a(JL) (or equivalently of the purely real 
characteristic value A = iv of C) as follows: 

k(X) := — sign(Ku, u), where A G R, JLu = —iXu, K = (iJ)^ 1 , and ii^O. (1-8) 

For simple v G iR, (Ku, u) is necessarily nonzero^] by the invertibility of J. 

Consider a simple point v — —iX in cr(JL) n iR. If the matrices J and L are 
subjected to sufficiently small admissible perturbations, then (i) v remains purely 
imaginary and simple and (ii) k(X) remains constant; as an integer-valued continuous 
function (of J and L), the only way a Krein signature k(X) can change under per- 
turbation is if v = —iX collides with another nonzero purely imaginary eigenvalue. 
From this point of view, one of the most important properties of the Krein signature 
is that it captures the susceptibility of a point v G cr(JL) fl iW. to Hamiltonian- Hopf 
bifurcation [31[7TJ[7I] (see also MacKay |5S] for a geometric interpretation of the Krein 
signature within this context) in which two simple purely imaginary points of a(JL) 
collide under and leave the imaginary axis. Indeed, for bifurcation to occur, it is nec- 
essary that the colliding points have opposite Krein signatures. In fact, this condition 



3 Unfortunately, it is equally common in the field for the Krein signature to be defined as 
sign(L«, u), and while the latter is more obv iously connected to the linearized energy, our defini- 
tion is essentially equivalent according to | |f .7) and provides a more direct generalization beyond the 
context of linear pencils considered here. 

4 The argument is as follows: since v = —i\ is a simple point of a( JL), the space X can be 
decomposed as X = span{u} © X c where X c is the complementary subspace invariant under JL, and 
JL + i\ is invertible on X c . Therefore, v £ X c can be represented in the form v = {JL + iX)w for 
some w £ X c , and it follows that for all v £ X c we have 

(J"V = (J- 1 u,(JL + i\)w) = -((LJ + i\)J- 1 u,w) = -(J _1 (JL + iA)«,w). 

Since A £ R and JLu = —i\u we see that (J~ 1 u, •) vanishes on X c . But since J is invertible this 
form cannot vanish on all of X and hence it must be definite on X QX C implying that (J~ 1 u, u) ^ 0. 
See |44| for more details. 



GRAPHICAL KREIN SIGNATURE AND EVANS-KREIN FUNCTIONS 



7 



is also sufficient, in the sense that if satisfied there exists an admissible deformation 
of JL that causes the Hamiltonian-Hopf bifurcation to occur. On the other hand, 
imaginary points of a(JL) with the same Krein signatures cannot leave the imaginary 
axis even if they collide under perturbation. 



The definition (1.8) makes the Krein signature re(A) appear as either an algebraic 
quantity (computed via inner products) , or possibly a geometric quantity (measuring 
relative orientation of root spaces with respect to the positive and negative definite 
subspaces of the linearized energy form). We would now like to emphasize a third 
interpretation of the formula, related to the graphical method for linearized Hamil- 
tonians introduced above. Indeed, from the point of view of Krein signature theory, 
the main advantage of the reformulation of ( |1.1[ ) as (1.5 1 is that if vq is a purely 



imaginary simple point in a(JL) corresponding to the intersection of an eigenvalue 
branch //(A) with /x = at the real characteristic value A = Ao = wq of £, then the 
Krein signature k(Ao) turns out to have a simple interpretation as the slope of the 
branch at the intersection point: 



k(A ) = sign 



dX {X °< 



(1.9) 



Therefore, without computing any inner products at all, we can read off the Krein 
signatures of the imaginary spectrum of JL in Examples [T] and [3] above simply from 
looking at the diagrams in the corresponding panels of Fig. |1.2| For Example [T] 
the signatures of the negative characteristic values are k = 1 while those of the 
positive characteristic values are k = — 1, and the only possibility for the system to 
become structurally unstable to Hamiltonian-Hopf bifurcation would be for a pair 
of characteristic values to collide at A = (forced by full Hamiltonian symmetry), 
and this clearly requires L to become indefinite under perturbation. On the other 
hand, Example [3] represents a somewhat more structurally unstable case; in order of 
increasing A the signatures are k = 1,-1,1,-1. Therefore all of the pairs of adjacent 
real characteristic values are susceptible to Hamiltonian-Hopf bifurcation. 



The formula (1.9) yields perhaps the easiest proof that simple real characteristic 
values of the same Krein signature cannot undergo Hamiltonian-Hopf bifurcation even 
should they collide; locally one has two branches with the same direction of slope, and 
the persistent reality of the roots as the branches evolve can be seen as a consequence of 



the Intermediate Value Theorem. To prove (1.9), one differentiates (1.5) with respect 
to A at A = Ao and fx = 0, obtaining the equation (L — XoK)u'(Xq) = (//'(Ao) + K)u. 
Taking the inner product with the characteristic vector u = u(Xq) satisfying (L — 
XqK)u — and using selfadjointness of L — XqK for Ao G K gives 

n'(X ){u,u) = -{Ku,u). (1.10) 



Since (u,u) > 0, the expression ( |l.9[ ) immediately follows from the definition (1.8). 
This shows that not only are the locations of the intercepts of the curves /i = fi(X) 
important, but it is also useful to observe the way the curves cross the fi = axis. 

1.4. Generalizations. Organization of the paper. The notion of Krein 
signature of real characteristic values has become increasingly important in the recent 
literature in the subjects of nonlinear waves, oscillation theory, and integrable systems 
[3 [3 El HOI I2H EH IS1 H31 IIH E3 ■ In many of these applications, the situation is more 
general than the one we have considered so far. One direction in which the theory 
can be usefully generalized is to replace the matrix C — L — XK with a more general 
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Hermitian matrix function of a real variable A resulting in a matrix pencil that is 
generally nonlinear in A. Another desirable generalization is to be able to work in 
infinite-dimensional spaces where the operators involved are, say, differential operators 
as might occur in wave dynamics problems. The basic graphical method described 
above can also be applied in these more general settings. 

The main applications we have in mind are for linearized Hamiltonian systems of 
the sort that arise in spectral stability analysis of nonlinear waves, i.e., linear eigen- 
value problems. However many of the ideas used here can be traced back to a nearly 
disjoint but well-developed body of literature concerning nonlinear eigenvalue pencils 
and matrix polynomials |21l 1231 15 7j . The thought to connect aspects of this theory 
to problems in stability of nonlinear waves appears to have come up quite recently, 
although similar ideas were already used in [39]. In [J2]we review the basic theory of 
matrix and operator pencils, which lays the groundwork for both the generalization 
to nonlinear dependence on the characteristic value and the generalization to infinite- 
dimensional spaces. Then, in ^j3]we give a precise definition of Krein signature along 



the lines of (1.8) and show how also for operator pencils there is a way to deduce 



the Krein signature from the way that an eigenvalue curve passes through ji = 0, 



procedure that is a direct generalization of the alternate formula (1.9). 

In Sj4j we consider the problem of relating Krein signatures to a common tool 
used to detect eigenvalues, the so-called Evans function [TJ [TH1 Unfortunately, 
attempts to deduce the Krein signature of eigenvalues from properties of the Evans 
function itself are easily seen to be inconclusive at best. However, the graphical (or 



perhaps topological) interpretation of the Krein signature as in ( 1.9 ) suggests a simple 
way to modify the traditional definition of the Evans function in such a way that the 
Krein signatures are all captured. This modification is even more striking when one 
realizes that the Evans function itself is based on a (different) topological concept pQ, 
a Mel'nikov-like transversal intersection between stable and unstable manifolds. We 
call this modification of the Evans function the Evans-Krein function, and we describe 
it also in S|4j The main idea is that, while in the linearization of Hamiltonian systems 
the Evans function restricted to the imaginary z^-axis characterizes the product of 
individual algebraic root factors, the Evans-Krein function is able to separate these 
factors with the help of the additional parameter /j,. The use of the Evans-Krein 
function therefore allows these different root factor branches to be traced numerically, 
without significant changes to existing Evans function evaluating codes. 

In <|5]we extend the kind of simple arguments used to determine spectral stability 
in Examples [l]-[3] above to give short and essentially topological proofs of some of the 
well-known index theorems for nonselfadjoint spectral problems that were originally 
proven by very different, algebraic methods pH [34j EH [37l [65] . To keep the exposition 
as simple as possible, we present our new proofs in the finite-dimensional setting. 
The added value of the graphical approach is that it makes the new proofs easy to 
visualize, and hence to remember and generalize. Finally, we conclude in Sj6] with a 
brief discussion of related open problems. 

Our paper features many illustrative examples. Readers trying to understand the 
subject for the first time may find it useful to pay special attention to these. 

2. Matrix and Operator Pencils. 

2.1. Basic terminology and examples. In the literature the terms operator 
pencil or operator family frequently refer to the same type of object: a collection of 
linear operators depending on a complex parameter lying in an open set ScC, that 
is, a map C = C(X) from A G S into a suitable class of linear operators from one 
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Banach space, X, into another, Y. Perhaps the most common type of pencil is a so- 
called polynomial pencil for which £(A) is simply a polynomial in A with coefficients 
that are fixed linear operators. (The sub-case of a linear pencil has already been 
introduced in Section [f .f | ) If X and Y are finite-dimensional Banach spaces, we have 
the special case of a matrix pencil. The coefficients of a polynomial matrix pencil 
C(X) can (by choice of bases of X and Y) be represented as constant matrices of the 
same dimensions. We will only consider the case in which X and Y have the same 
dimension, in which case the coefficient matrices of a polynomial matrix pencil are all 
square. Specializing in a different direction, if X — Y is a (self-dual) Hilbert space, 
a pencil (operator or matrix) is said to be selfadjoint if S = S and £(A) = £(A)*. 
With the choice of an appropriate orthonormal basis, the coefficients of a selfadjoint 
polynomial matrix pencil all become Hermitian matrices. 

An operator pencil consisting of bounded linear operators £(A) on a fixed (possibly 
infinite-dimensional) Banach space X is said to be holomorphic at Ao G S if there is 
a neighborhood D of Ao in which C(X) can be expressed as a convergent (in operator 
norm) power series in A (see also |57|, pp. 55-56]). If D contains a real open interval 
I C R (1 S, we say that C is continuously differentiable on / if the restriction of C to 
/ is continuously norm-differentiable. For holomorphic (continuously differentiable) 
matrix pencils, the individual matrix entries of the matrix C(X) are all holomorphic 
functions of A near Ao (continuously differentiable functions on /). 

The theory of operator pencils, and matrix pencils in particular, is well-developed 
in the literature. The review articles by Tisseur and Meerbergen |69j and Mehrmann 
and Voss |BD] are general references that give a numerous applications of operator 
pencils and survey suitable numerical methods for their study. Polynomial pencils 
are particularly well- understood and have an extensive spectral theory J5TJ [23J EZ] • 
While most of the theory is concerned with finite dimensional matrix pencils (called A- 
matrices or gyroscopic systems) some results have been obtained for general operator 
pencils (see [I7J [35J [57] and references therein) . The spectral theory of A-matrices 
under various conditions was developed in detail by Lancaster et al. [3DJ [SOI EH [S3] ; 
see also [5T] for the related subject of perturbation theory of analytic matrix functions. 

In order to better motivate the theory of linear and nonlinear operator pencils, we 
first give some concrete examples of how they arise naturally in several applications. 

< Example 4- Spectral problems in inverse-scattering theory for integrable partial 
differential equations. As is well known, some of the most interesting and important 
nonlinear partial differential equations of mathematical physics including the modified 
focusing nonlinear Schrodinger equation 

^ + ^* + |0| 2 </> + ja(|0| 2 0)* = O, ael (2.1) 

governing the envelope 4>(x,t) of (ultrashort, for a^O) pulses propagating in weakly 
nonlinear and strongly anomalously dispersive optical fibers, and the sine-Gordon 
equation 

Utt ~ u xx + sin(u) = (2.2) 

arising in the analysis of denaturation of DNA molecules and the modeling of super- 
conducting Josephson junctions, are completely integrable systems. One of the key 
implications of complete integrability is the existence of an inverse-scattering trans- 



form for solving the Cauchy initial- value problem for x G R in which (2.1| is given 



the complex- valued Schwartz-class initial condition (p(x, 0) = <fio(x) and (2.2) is given 
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the real-valued Schwartz-clas^] initial conditions u(x, 0) = f(x) and u t (x,0) — g(x), 
and in each case the solution is desired for t > 0. The inverse-scattering transform 
explicitly associates the initial data for each of these equations to a certain auxiliary 
linear equation involving a spectral parameter, the spectrum of which essentially en- 
codes all of the key properties of the solution for t > 0. For the modified focusing 



nonlinear Schrodinger equation (2.1 1 with a^O the auxiliary linear equation for the 
inverse-scattering transform is the so-called WKI spectral problem due to Wadati et 
al. [73) with spectral parameter £ and vector- valued unknown v — v(x): 

Avki(0« = 0, where £wki(0 := £ 2 W 2 + £Wi + W , (2.3) 

involving a quadratic pencil £wki(£), with coefficients being the linear operators 

W 2 := 2ia 3 , Wi := -2ia [ ~ ^° ] , W := a^- - Ua 3 . (2.4) 



J ' u ' da; 2 

Here a 3 is a Pauli spin matri^ In the special case a — 0, one has to use a differ- 
ent auxiliary linear equation known as the Zakharov-Shabat problem [76] which is 
commonly written in the form 

sr-^+U. or (2 - 5) 

which by multiplication on the left by ia 3 is obviously equivalent to a usual eigenvalue 
problem for an operator that is non-self adjoint with respect to the L 2 (K) inner product 
(augmented in the obvious Euclidean way for vector functions v). However, note 
that if one sets A = i£, v(x) = e 17Ta3 ^ 4 w(x), and multiplies through on the left by 
icrie - " ' 3 / 4 , the Zakharov-Shabat problem is also equivalent to the equation 

£ zs (A)u; = 0, where £ ZS (A) := XZ 1 + Z Q (2.6) 

involving a linear pencil £zs(A) with coefficients 

S-^-ft ;V (2.7) 



On the other hand, the initial- value problem for the sine-Gordon equation (2.2 1 is 
solved by means of the Faddeev-Takhtajan problem: 

£ft(0« = 0, Cft :=fFi +F +r 1 J r -i, (2-8) 
a problem for a rational operator pencil with coefficients 



F := - .ga 2 F ±1 := sm(A/)<7i T cos(§/)(7 3 . (2.9) 



5 More generally, one only assumes that sin(/(ir)) is Schwartz class to admit the physically inter- 
esting possibility of nonzero topological charge in the initial data in which the angle / increases by 
a nonzero integer multiple of 2tt as x ranges over R. 

6 The Pauli spin matrices are: 

l\ f0 -i\ (\ 



CT1 :=, 1 <7 2 :=\ i , OB-. _! 



GRAPHICAL KREIN SIGNATURE AND EVANS-KREIN FUNCTIONS 



11 



In all of these cases, the values of £ for which there exists a nontrivial L 2 (M.) solution v 
parametrize the amplitudes and velocities of the soliton components of the solution of 
the corresponding (nonlinear) initial-value problem. The solitons are localized coher- 
ent structures that appear in the long-time limit, and as such it is important to have 
accurate methods to determine the location of any such discrete spectrum. Bounds 
for the discrete spectrum of the WKI spectral problem can be found in 12], and 
very sharp results that under some natural qualitative conditions on the initial data 
confine the discrete spectrum to the imaginary axis for the Zakharov-Shabat problem 
and the unit circle for the Faddeev-Takhtajan problem were found by Klaus and Shaw 
[42] and by Bronski and Johnson [7j respectively. In particular, the techniques used 
in H2] can be interpreted in terms of Krein signatures. Indeed, one of the key 
conditions required in 42"] is that the initial condition 4>q{x) is a real- valued function, 
which makes £zs(-V) a selfadjoint linear pencil. The significance of selfadjointness will 
become clear later. t> 

< Example 5. Hydrodynamic stability. Consider a steady plane-parallel shear flow 
of a fluid in a two-dimensional channel with horizontal coordinate x € K and vertical 
coordinate z € [z\,Z2), and let U — U(z) be the horizontal velocity profile. If the 
stream function of such a flow is perturbed by a Fourier mode of the form ^{z)e lk ^ x ~ ct ^ 
with horizontal wavenumber k e R, then in the case of an inviscid fluid the dynamical 
stability of such a flow is governed by the Rayleigh equation [13] : 

£ R (c)$ = 0, £ r (c) :=cL!+L , (2.10) 

where <E> is subjected to the boundary conditions &(zi) — $(2:2) = 0, and where £r(c) 
is a linear operator pencil with coefficients 

Li:=^-fc 2 , L :=U" -UL X . (2.11) 

For a viscous fluid, the unperturbed flow is characterized by the Reynolds number i?, 
and its stability is determined from the Orr-Sommerfeld equation |13j : 

£os(c)$ = 0, £ OS (c) := cM 1 + M , (2.12) 

where $ is subjected to the "no-slip" boundary conditions $(21) = $(z 2 ) = and 
&(zi) = &(z 2 ) — 0, and where £os( c ) is a linear operator pencil with coefficients 

Mi := ikRLt, M := ikRL + L\. (2.13) 

In both cases, the values of c e C for which there exists a nontrivial solution $ are 
associated with exponential growth rates of —ike, and hence the flow is stable to 
perturbations of wavenumber k if the corresponding values of c are all real. t> 

< Example 6. Traveling wave stability in Klein- Gordon equations. Let V : K — > K be 
a potential function. The Klein-Gordon equation (of which the sine-Gordon equation 



(2.2) is a special case for V{u) := 1 — cos(u)) is 

u tt - u xx + V'(u) = 0. (2.14) 

Travelling wave solutions u(x,t) = U(z), z = x — ct, of phase speed c satisfy the 
Newton-type equation (c 2 — l)U"(z) + V'(U(z)) = 0. The wave is stationary in the 



co-moving frame in which (2.14) is rewritten for u = u(z,t) in the form 



iHt ~ 2cu zt + (c 2 - l)u zz + V'(u) = 0. (2.15) 
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Writing u = U(z) + v(z,t) and linearizing for v small one obtains 

v tt - 2cv zt + (c 2 - l)v zz + V"(U(z))v = 0, (2.16) 

and seeking solutions of the form v(z, t) = 4>(z)e lXt for A £ C one arrives at a spectral 
problem involving a quadratic pencil: 

£kg(A)0 = O, £ KG := X 2 L 2 + \L 1 + L , (2.17) 

where the coefficients are 

L 2 :=l, L x :=2ic^ L Q := (I - c 2 )^ - V" (U(z)). (2.18) 

Note that £kg(A) is an example of a (formally) selfadjoint quadratic operator pencil. 
Spectral stability is deduced |35j if all values of A for which there exists a nontrivial 
solution (j) are purely real. \> 



<\ Example 7. The Rayleigh- Taylor problem. Here we give an example of an operator 
pencil involving partial differential operators and nonlocality (through a divergence- 
free constraint). The Rayleigh- Taylor problem concerns the stability of a stationary 
vertically stratified incompressible viscous fluid of equilibrium density profile p e {z). 
Making a low Mach number approximation, assuming a small perturbation of the 
zero velocity field, and linearizing the Navier-Stokes equations, one is led to consider 
normal mode perturbations of the form e xt u(x, y, z) where the linearized velocity field 
satisfies appropriate no-slip boundary conditions and 

- A 2 p e u + V • (p e u)g + A(Vp - i)Au) = , V-u = 0, (2.19) 

where g is the gravitational acceleration field, r\ > is the viscosity, and p is the pres- 
sure term needed to satisfy the incomprcssibility constraint 28J . A weak reformulation 



on an appropriate divergence-free space allows (2.191 to be cast into the form of an 
equivalent quadratic pencil with parameter A on a Hilbert space. For non-Newtonian 
Maxwell linear viscoelastic fluids, the pencil that arises in the Rayleigh- Taylor prob- 
lem is a cubic polynomial [43] . t> 

The preceding examples all involve polynomial operator pencils (or, like the ra- 
tional pencil appearing in the Faddeev-Takhtajan spectral problem in Example [4j 
that can easily be converted into such). However, it is important to observe that 
spectral problems for non-polynomial operator pencils also occur very frequently in 
applications, especially those involving a mix of discrete symmetries and continuous 
symmetries for which the dispersion relation for linear waves is transcendental. A 
fundamental example is the following. 

< Example 8. Delay differential equations Non-polynomial operator pencils appear 
naturally in systems of differential equations with delays [33J • Consider the system 

x(t) = Ax{t) + Bx(t-r), (2.20) 

for x € C™, where r > is a fixed delay and A,B are complex n x n matrices. To 



study the stability of solutions to ( 2.20 ) with exponential time-dependence of the form 
x(f) = e xt Xo one needs to solve the spectral problem 



-DDE 



(A)x = 0, (2.21) 
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where £dde is the essentially transcendental matrix pencil 

£dde(A) := AI- A-e- TX B. (2.22) 
The existence of values of A e C with Re{A} > for which there exists a nonzero 



solution xq of (2.211 indicates instability of the system (2.20|. \> 



Other applications of operator pencils include the analysis of electric power sys- 
tems [59], least-squares optimization, and vibration analysis [69]. Various examples 
of non-polynomial operator pencils are described in the documentation to the MAT- 
LAB Toolbox NLEVP H], including applications to the design of optical fibers and 
radio-frequency gun cavity analysis. Non-polynomial operator pencils also arise in 
band structure calculations for photonic crystals [14] . another example of a system 
in which discrete symmetry (entering through the lattice structure of the crystal) 
interacts with continuous symmetry (time translation). An application of quadratic 
operator pencils to second-order in time Hamiltonian equations can be found in [S], 
in which a particular case of the generalized "good" Boussinesq equation is studied. 
Finally, note that in an alternative to the approach to linearized Hamiltonians de- 
scribed in j |l.l| the so-called Krein matrix method of Kapitula [3S] relates a general 
linearized Hamiltonian spectral problem to a non-polynomial operator pencil. 

2.2. Spectral Theory of Operator Pencils. In this subsection we summarize 
the theoretical background needed for our analysis. In addition to a proper definition 
of the spectrum (characteristic values) of an operator pencil £ and its relation to the 
spectra cr(£(A)) of the individual operators £(A) making up the pencil £ as A varies, 
our method relies on a kind of continuity of <r(£(A)) with respect to A, which can be 
obtained with appropriate general assumptions. Note that even in the general case 
of operator pencils on infinite dimensional spaces, finite systems of eigenvalues have 
many properties similar to those of eigenvalues of matrices [39] , This allows us to 
study a wide class of problems, although infinite systems of eigenvalues may exhibit 
various types of singular behavior. 

2.2.1. Matrix pencils. Matrix pencils and their perturbation theory are stud- 
ied in [3TJ [521 H21 [Ml [S7] with a particular emphasis on polynomial matrix pencils. 
The finite dimensional setting allows a simple formulation of the spectral problem. 

Definition 2.1 (Characteristic values of matrix pencils). Let £ — £(A) be a 
matrix pencil on X = C N defined for A G S C C. The characteristic values of £ are 
the complex numbers A £ S that satisfy the characteristic equation det(£(A)) = 0. 
The set of all characteristic values is called the spectrum of £ and is denoted c(£). 

While the matrices involved all have size N x N for each A £ S, the characteristic 
equation need not be a polynomial in A and matrix pencils in general can have an 
infinite number of characteristic values. However, characteristic equations of polyno- 
mial matrix pencils of degree p in A are polynomial of degree at most pN in A, with 
equality if and only if the leading coefficient is an invertible matrix. 

On the other hand, the eigenvalues \i of the related eigenvalue problem £(X)u = 
\m satisfy the modified characteristic equation det(£(A)— fjJ) = [Ml H.2.1, p. 63] and 
belong to the A-dependent spectrum er(£(A)) (understood in the usual sense) of the 
£(A). Of course, as /i solves a polynomial equation of degree N in /i, the total algebraic 
multiplicity of the eigenvalues /i is equal to dim(A) = N, independent of A £ S. It is 
well-known that if the matrix pencil £ is holomorphic at Ao € S and if all eigenvalues \i 
of £(Ao) are simple, then the N eigenvalue branches fi = fi(X) are locally holomorphic 
functions of A near Aq. In fact this is an elementary consequence of the Implicit 
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Function Theorem. The possible singularities at exceptional points A (corresponding 
to non-simple eigenvalues fx) are well understood. For linear pencils of the form £(A) = 
Lq + Aii it is worth emphasizing that the individual eigenvalue functions /i = /Lt(A) 
are generally not linear functions of A; moreover, the corresponding eigenprojections 
Pn{\) can have poles as functions of A. See Motzkin and Taussky [6TJ [62] for a study 
of special conditions under which the eigenvalues of linear pencils are indeed linear 
in A and the corresponding eigenprojections are entire functions. Furthermore, mere 
continuity of £(A) does not imply continuity of eigenvectors (see [521 H.1.5, p. 110] for 
specific examples). The situation simplifies for self adjoint holomorphic matrix pencils 
as the following theorem indicates. 

Theorem 2.2 ( 39,, II.6.1, p. 120, Theorem 6.1]). Let £(A) be a selfadjoint 
holomorphic matrix pencil defined for A € S = S with IcS. Then, for A 6 1, the 
eigenvalue functions /i = fi(X) can be chosen to be holomorphic. Moreover, for A 6 M, 
the (orthonormal) eigenvectors can be chosen as holomorphic functions of X. 

Thus even if a fc-fold eigenvalue fj, occurs for some Ao £ R, locally this can be 
viewed as the intersection of k real holomorphic branches fi — /i(A) (and similarly for 
the corresponding eigenvectors) . Note that analyticity is not a necessary condition for 
mere differentiability of eigenvalues /i and eigenvectors for AeK. Indeed, according 
to Rellich [68] , selfadjoint continuously differentiablc matrix pencils have continuously 
diffcrentiable eigenvalue and eigenfunction branches [39l II.6.3, p. 122, Theorem 6.8]. 

2.2.2. Operator pencils. In passing to infinite-dimensional spaces, we want 
to restrict our attention to holomorphic pencils, and to handle unbounded (e.g. dif- 
ferential) operators, we need to generalize the definition given earlier. A convenient 
generalization is the following. 

Definition 2.3 (Holomorphic families of type (A) [39j VII. 2.1, p. 375]). An 
operator pencil £, £(A) : D(X) C X X, defined for X £ S is called a holomorphic 
family of type (A) if the domain D = D(X) is independent of X £ S and if for every 
u £ D, £{X)u is holomorphic as a function of X £ S . 

Note that a holomorphic pencil of densely-defined bounded operators (having by 
definition an operator-norm convergent power series expansion about each Ao £ S) 
is an example of a holomorphic family of type (A). In this context, we present the 
following notion of spectrum of operator pencils (see also [2"31 |4"U1 Wf\ ). 

Definition 2.4 (Spectrum of operator pencils and related notions, [57]). Let £ 
be a holomorphic family of type (A) on a Banach space X with domain D defined for 
X £ S C C. A complex number Xq £ S is called a regular point of £ if £(Ao) has a 
bounded inverse on X. The set of all regular points of £ is called the resolvent set 
p(£) of £. The complement of p(£) in S C C is called the spectrum cr(£) of £. A 
complex number Xq £ S is called a characteristic value of £ if there exists a nonzero 
u £ D, called a characteristic vector corresponding to Xq, satisfying 

£(A )u = 0. (2.23) 

The dimension of the kernel of £(Ao) is called the geometric multiplicity of Xq. A 
sequence of vectors {u^ , , . . . , u^" 1 ^ 1 ^ }, where vP^ is a characteristic vector for Xq, 
is called a chain of root vectors ( or generalized characteristic vectors ) of length m if 

£i£«(A )Hb--*]=o, j = l,...,m-l, £«(A >:= ^fOC(A)u) 

fe=0 ' A=A o 

(2.24) 
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A chain of length m that cannot be extended to a chain of length m + 1 is called a 
maximal chain, and its vectors span a root space for the characteristic value Ao . The 
sum of lengths of all maximal chains corresponding to a characteristic value Ao is 
called the algebraic multiplicity of Ao- A characteristic value is called semi-simple if 
its geometric and algebraic multiplicities are finite and equal. 

The characteristic values of £ are contained in (but need not exhaust) ct(£). Note 
that for holomorphic matrix pencils, the algebraic multiplicity of a characteristic value 
Ao reduces to the order of vanishing of the characteristic determinant det(£(A)) at 
Ao. However, in contrast to a Jordan chain of a non-semi-simple eigenvalue of a 
matrix, the root vectors making up a chain for an operator pencil need not be linearly 
independent, and a generalized characteristic vector can be identically equal to zero. 

We can also consider the spectrum of £(A) as an operator depending paramet- 
rically on A G S, and in particular the eigenvalues /j,(X) of £(A). The fact that 
holomorphic families £(A) of type (A) that have compact resolvent for some Ao G S 
have a compact resolvent for all A € S VII. 2.1, p. 377, Theorem 2.4] can then be 
used to establish the following theorem. 

Theorem 2.5 (Kato, [3l£ VII.3.5, p. 392, Theorem 3.9]) . Let £(A) be a selfadjoint 
holomorphic family of type (A) for A G S, and let Iq C S be a real interval. If £(Ao) 
has a compact resolvent for some A G S then all the eigenvalues of £(A) can be 
represented as a sequence {fj,j(X)} of real holomorphic functions on I Q . Furthermore, 
the complete orthonormal family of corresponding eigenvectors can also be represented 
by a sequence of vector valued holomorphic functions on Iq. 

2.2.3. A special technique for polynomial operator pencils. Let £ be a 

polynomial operator pencil of degree p: £(A) := L + XLi + ■ • • + X p L p , where Lj are 
all operators defined on a common dense domain D within a Banach space X, and 
suppose that L p is invertible. Such a pencil is obviously a holomorphic family of type 
(A). In this case, the problem of characterizing the spectrum <t(£) can be reduced to 
that of finding the spectrum (in the usual sense) of a single operator Cc , called the 
companion matrix 23, 157). acting on the p-fold Cartesian product space X p . Indeed, 
given u G D, define u := (u, Xu, . . . , A p_1 m) t G X p . Then the equation £(A)u = on 
X is equivalent to the standard eigenvalue equation Cxu = Au on X p , where Cc is 
the p x p matrix of operators: 



Cr 



( I 

I 




\—Lq —L\ —Li 



\ 




-L p -iJ 



L k :=L- x L k , 0<k<p-l. (2.25) 



Here I and are respectively the identity and zero operators acting on X. The root 
spaces of the pencil £ are the images of the invariant subspaces of the companion 
matrix Cc under the obvious projection of X p onto the first factor X. 

Since groups of isolated eigenvalues of C'c with finite multiplicities have prop- 
erties similar to those of eigenvalues of finite-dimensional matrices, it is possible to 
completely characterize the chains of the pencil £ in terms of the Jordan chains of Cc ■ 
The constructive algebraic proof given for matrix pencils in |23j can be used without 
modification (see also [57j §12]). 

Theorem 2.6 ([231 p. 250, Proposition 12.4.1]). Let £ be a polynomial operator 
pencil of degree p with an invertible leading coefficient L p , and let Cc be the companion 
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matrix of C. Then {■ 



J ], . . . , ti^'™" 1 !} is a chain of length m for C as defined in 
(2.2$ at a characteristic value Xq if and only if the columns of the p x m matrix 
V = {U 7 U Jq, UJq 1 ) t form a standard Jordan chain for C'c corresponding to 
the same Xq, where Jq is the m x m matrix Jordan blocl^ with eigenvalue Ao and 
U = (uM,...,v,l m -V). 

2.3. Indefinite inner product spaces. Unfortunately, in the case that £ is a 
selfadjoint polynomial operator pencil (with coefficients Lj being selfadjoint operators 
densely defined on a Hilbert space X), selfadjointness is lost in the extension process 
and the companion matrix Cc acting on X p is non-selfadjoint with respect to the 
"Euclidean" inner product on X p induced by that on X: (u,v) := (tti,«i) + ■ • • + 
(u p ,Vp). However, a calculation shows that Cc is indeed selfadjoint with respect to 
an indefinite quadratic form defined by 



(u,v) := (u,B £ v), B c := 



/ Li 
L 2 



j p-\ 



L 2 
L 3 

Lp 




L 



p-i 









0/ 



(2.26) 



Indeed, the Hankel-type operator matrix Be (selfadjoint with respect to the Euclidean 
inner product on X p ) intertwines the companion matrix Cc with its adjoint C* c with 
respect to the Euclidean inner product as follows: BcCc = C* c Bc- This implies that 
the root spaces in X p corresponding to different eigenvalues of Cc (characteristic 
values of C) are orthogonal with respect to the quadratic form (•, •) defined by (2.26). 



The extension technique therefore closely relates the theory of selfadjoint polyno- 
mial operator pencils to that of so-called indefinite inner product spaces or Pontryagin 
spaces. As is apparent from the definition (1.8 1, the Krein signature of a characteris- 



tic value is related to a certain indefinite quadratic form. Although one of the aims 
of our article is the avoidance of the algebra of indefinite inner product spaces, the 
latter are clearly lying just beneath the surface, so we would like to briefly cite some 
of the related literature. The seminal work of Pontryagin 66 opened up a huge field 
devoted to the spectral properties and invariant subspaces of operators in such spaces 
having a wide variety of applications. Important further developments of the theory 
were made in [35] , and general references for many of the key results include [501 [23] • 
Over 40 years after its publication a central result of the theory — the Pontryagin 
Invariant Subspace Theorem — was rediscovered in connection with nonlinear waves 
and eigenvalue counting theorems [11] [2SJ [55] (see [TT] for a historical discussion) . 

3. Graphical Interpretation of Multiplicity and Krein Signature. This 
section contains a survey of known results connecting the characteristic value problem 
(2.23) for a selfadjoint operator pencil £ to the family, parametrized by A € R, of 



selfadjoint eigenvalue problems 



C(X)u(X) = /*(A)u(A). 



(3d) 



3.1. Multiplicity of characteristic values. We first present a theorem well- 
known in the theory of matrix pencils ([23 §12.5, p. 259], see also [33 EZ] for a similar 
approach in the case of operators) . The key idea here is to relate the linear-algebraic 



7 Jokl '■= ^O^fc-i + <5fc-;+i where &i denotes the Kronecker delta symbol. 
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notions of algebraic and geometric multiplicity of characteristic values and root vector 
chains to information contained in the graphs of eigenvector branches fj, = /j,(X) and 
the corresponding eigenvector branches u = u{\) of the selfadjoint eigenvalue problem 



(3.1 ). To give better insight into this result so fundamental to the approach described 
in this paper, we give its complete proof. A different proof can be found in [571 p. 57, 
Lemma 11.3, and p. 61, Corollary 12.4 and Lemma 12.5], where /i;(A) is called a 
generating polynomial (see also [131 [35] ) . The key property needed to establish the 
result is some degree of smoothness of the real functions /i ~ /j,(X) at their roots; while 
it is not completely necessary, for simplicity we will assume that £ is a selfadjoint 
operator pencil that (i) is a holomorphic family of type (A) and (ii) has compact 
resolvent for some (and hence for all) Ao <G S. Under these conditions we can appeal 
to Theorem |2.5| to guarantee the required smoothness. Our assumptions have the 
added benefit of making C(X) a Fredholm operator for each A £ R C S. However, 
most of what will be formulated below will also hold if only £(A) is Fredholm and the 
branches fi = /x(A) of interest are sufficiently smooth near a real characteristic value 
A . 

Theorem 3.1 (Graphical interpretation of multiplicity). Let C be a selfadjoint 
holomorphic family of type (A) with compact resolvent, and assume that C has an 
isolated real characteristic value Ao . Then the following properties are equivalent: 

(a) Ao has finite algebraic multiplicity a and geometric multiplicity k with corre- 
sponding maximal chains of root vectors (mi + • • • + = a) 

{4 01 ,-..,4 mi " 1] }, {4 01 ,...,4 m2 - 1] }, .... {4 0] ,-.4 mfc - 1] }- (3.2) 

(b) There exist exactly k analytic eigenvalue branches /i = /ii(A), . . . , /i = /Ufe(A), 
vanishing at A = Ao where fJ,j(X) vanishes to order rrij: (j,^ (Ao) = for 

n = 0, 1, ... , rrij — 1, while fij 1 (Ao) =/= 0. The derivatives of the corresponding 
orthonormal analytic eigenvector branches u — Ui(A), . . . ,u — Ufe(A) encode 
the chains of root vectors as follows: 

U * ] = r! S^ (Ao) ' ^M,...,™,-!. (3.3) 

Proof. Note that £(Ao) is a Fredholm operator, so the Fredholm alternative 
applies, and that analyticity of all eigenvalue and eigenvector branches is guaran- 



teed by Theorem 2.5 (The theorem holds true under more general conditions that 



guarantee these properties of C.) We prove the theorem by mathematical induction 
on m = maxi<j<fc mj. 

Let m = 1 (and hence rrij = 1 for j = 1, . . . , k). First suppose that (b) holds, 
which implies that Ker£(Ao) is spanned by the vectors iti(Ao), • • • , Wfc(Ao) and one 
can set u 1 ^ := Uj(Xo), j — 1, . . . , k. It remains to show that each of the single-vector 



chains {w^} is maximal. Our assumptions allow us to differentiate (3.1 ) with respect 



to A along the branches fi = fJ-j(X) and u = Uj(X) and evaluate at A = Ao to obtain 

£'(A )4 0] + £(Ao)^(A ) - ^(Xo)uf (3.4) 
where we have used ^(Aq) = 0. Note that as rrij = 1 we have /i' (Aq) ^ 0. If the 



chain } were not maximal, then by Definition 



2.4 



there would exist a vector u} 



such that C{\ )uf + £'{\ )uf ] = 0. Since £{\ )uJ r = and (uf\uf ] ) = 1, we can 
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combine this with ( |3.4[ ) to obtain 

= (<(A ) - uf ] ,£(X )uf ] ) = (£(A ) [<(A ) - uf ] ] ,uf) = ^(A ) ^ 0, (3.5) 

an obvious contradiction. Therefore each chain {u^} for j = l,...,k, is maximal, 
and we conclude that (a) holds. 

On the other hand, if (a) holds, then Ker£(Ao) has dimension fc, and hence 
there exist k eigenvalue branches fi = fJ-j(X) (counted with multiplicity), all analytic 
functions with roots at A = Ao. To conclude that (b) holds it just remains to prove 
that each Hj(X) vanishes at A = Ao to exactly first order. But were this not the case 
for some j, we would have ^(Aq) = and then (3.4) would imply the existence of a 

generalized root vector Uj := u'j(Xo), in contradiction with the fact guaranteed by 
case (a) with rnj = m = 1 that Ao is semi-simple. The theorem therefore holds in the 
case that m = 1. 

Now suppose that the theorem is true for some m = M > 1. Let m = M + 1 and 
assume that (b) holds. Let /i = /ij(A) be an eigenvalue branch that vanishes at A = Ao 
to order M + 1. We differentiate (3.1) M times with respect to A along fj, = Hj(X) 
and u = Uj(X) at A = Aq to obtain 



71/ 



71=0 



J2 £W(A )uf-")(Ao) = 0, 



(3.6) 



since f/j 1 (Ao) = for n = 0, ... , M. By the inductive hypothesis, a chain of root 
vectors of length M for the characteristic value A is given by 



{uf\ufy,...,uf*-v} = L(x u j (\ ) 



i 



,(M-1) 



(M - 1)1 



(Ao) 



(3.7) 



Using (|3.7[) in (3.6) and comparing with Definition |2.4| shows that the chain can 



be extended to one of length M + 1 by defining u 



u ( i M) (X )/Ml That the 



extended chain is maximal follows from the fact that ^ M+1 \\o) ^ just as in the 
case of m = 1, and therefore (a) holds for m = M + 1. 

Now assume instead that (a) holds for m = M + 1 and that the theorem is true 
for m = M. Then for each j with rrij = M + 1 there exists a generalized root vector 



satisfying (according to Definition 



2.4 



and the inductive hypothesis) 

M 



£ i£(">(A 0) u^ = C{X Q )uf 1 + i £ Hr(»)(A )uf ->(A ) = 0. (3 



The inductive hypothesis implies that fij (Ao) = for n = 0, 1, . . . , M — 1, so in place 
of (3.6) we have 



A I 

E 

71=0 



£(")(A )^ M -" ) (A ) 



(M),, s [0 

A 1 } (Aq)Mj 



Comparing these two equations shows that 

£(A ) (^(Ao)-M!ufl)=^(A ) U ; 01 . 



(3.9) 



(3.10) 
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Thus |uJ M) (Ao)itJ 0] G Ker(£(A )) n Ran(£(A )) and by C(\ )* = £(A ) for A G R 

the Frcdholm alternative implies^] that /i^ M - ) (Ao) = because ^ 0. The proof of 

non- vanishing of /i^ (Ao) using the maximality of the chain is similar to the case 
of m = 1. Therefore (b) holds as well. 

By mathematical induction it therefore follows that the theorem is true for all 
finite to = maxKjXt rrij. □ 

The significance of Theorem |3.1| is that the algebraic and geometric multiplicities 
of isolated real characteristic values of selfadjoint pencils can be simply read off from 
plots of the real eigenvalue branches fi = fij(X) of the selfadjoint eigenvalue problem 



(3.1). Indeed, real characteristic values are simply the locations of the intercepts of 
the branches with fi — 0, the geometric multiplicity of a real characteristic value Ao 
is simply the number of branches crossing at the point (A,/i) = (Ao,0), each branch 
fi = (J-j(X) corresponds to a single maximal chain of generalized root vectors, and the 
length of each maximal chain is simply the order of vanishing of the analytic function 
/Zj(A) at A = Aq. 

3.2. Krein signature of real characteristic values. The Krcin signature of a 
real characteristic value is usually defined in terms of the restriction of an appropriate 
indefinite quadratic form to a root space, with the signature being positive, negative, 
or indefinite according to the type of the restricted form. Our goal is to give a useful 
definition of Krein signatures for real characteristic values of quite general operator 
pencils, however at the beginning we will restrict ourselves to polynomial pencils with 
invertible leading coefficient, as this makes available the companion matrix method 
described in § |2.2.3| In this case the relevant quantities can be defined as follows. 

Definition 3.2 (Krein indices and Krein signature). Let C be a selfadjoint 
polynomial operator pencil of degree p with invertible leading coefficient L p acting in 
a Hilbert space X, and let Ao be an isolated real characteristic value of C Given a 
root space 11 C X spanned by the vectors of a maximal chain {vJ°\ . . . ,iJ m-1 l} for 
Aq , let u^' G X p . j = 0, . . . , to — 1, denote the columns of the matrix V defined in 
the statement of Theorem \2.6\ and let W be the to x to Gram matrix with elements 
Wjk '■= (u^,uW), in terms of the indefinite Hermitian quadratic form (•,•) given 



by (2.261. The number of positive (negative) eigenvalues of the Hermitian matrix 
W is called the positive ( negative ) Krein index of the root spac^ U at Ao and is 
denoted k + (U,X ) (k~(U,Xo)). The sums of k^(U,Xq) over all root spaces U of 
Ao are called the positive and negative Krein indices of Xq and are denoted k; (Ao). 
Finally, k(U,Xq) :— k + (K,Xq) — k~(U,\q) is called the Krein signature of the root 
space 11 for Ao, and «(Ao) := k + (Ao) — k~(Ao) is called the Krein signature of Xq. 

If k + (Ao) > (respectively k~(Ao) > 0) we say that Ao has a non-trivial positive 
(respectively negative) signature, and if both k + (Xq) and k~(Ao) are positive we say 
that Ao has indefinite Krein signature. It is not obvious but truep"] that the Krein 
signature of a single root space k(U, Xq) is always either 1, —1, or regardless of the 
dimension of K. The "total" Krein signature k(Xq) of a real characteristic value Ao can, 
however, take any integer value by this definition. If Ao is a simple real characteristic 
value with characteristic vector u € X, then the only root space 11 is spanned by u, and 

8 If Ao ^ R the argument fails, as the Fredholm alternative only implies that fA (\o)u,j £ 

Ker(£(Ao)) and ^ M \Xq)u^ _L Ker(£(Ao))- Such a nontrivial vector can indeed exist. 

9 The Krein indices re (U, Ao) are in fact well-defined although the construction appears to depend 
on the specific choice of vectors makin g up the chain that spans the space XA. 



10 



It is part of the proof of Theorem 3.4 below. See [43 
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the corresponding (only) column of the matrix V is u = (u, Xqu, . . . , Aq u) t € X p , 
so the corresponding Gram matrix W is a real scalar given by 

W = (u, u) = {Liu + 2X Q L 2 u + ■■■+ pX^LpU, u) = (£'(A Q )u, u) (3.11) 



as a simple calculation using (2.26) shows. If furthermore £ is a linear selfadjoint 



pencil, C(X) = XLi + L , then C'(Xq) = L\ and the Krein signature is given by 



k(Xq) — sign(Liu,w), which coincides with the usual definition (see (1.8)) found 
frequently in the literature. 

This definition mirrors a recent approach |36l [38l EI] to spectral analysis of 
infinite-dimensional linearized Hamiltonian systems in which the important informa- 
tion about a point of the spectrum is obtained from a finite-dimensional reduction 
called a Krein matrix, an analogue of the Gram matrix W. Kollar and Pego [H] 
developed a rigorous perturbation theory of Krein signatures for finite systems of 
eigenvalues in the spirit of Kato |39j , proving results that had previously belonged to 
the mathematical folklore in the field. 

3.3. Graphical Krein signature theory. One of the main messages of our 
paper is that the correct way to generalize the notion of Krein indices and signatures 
so that they apply to and are useful in the analysis of spectral problems involving 
operator pencils of non-polynomial type like that illustrated in Example [8] is to eschew 
linear algebra in favor of analysis of eigenvalue curves /i = fJ-(X) of the problem 



(3.1) in the vicinity of their roots. This approach is attractive even in cases where 
companion matrix methods explained in §2.2.3| suffice to define the relevant quantities. 
We therefore begin by formulating such a "graphical" definition of Krein signatures. 

Definition 3.3 (Graphical Krein indices and signatures). Let C(X) be a self- 
adjoint holomorphic family of type (A) with compact resolvent, and assume that C 
has an isolated real characteristic va lue Xp . Let fi — /i(A) be one of the real analytic 



eigenvalue branches of the problem (3.1 ) with ^ n > (Ao) = for n = 0, 1, . . . , m — 1, 
while // m )(Ao) ^ 0. Let n(fi) := sign(// m ) (Ao)) = ±1. Then the quantities 

K*{H,X ) ~<f. ... (3.12) 

\ ^{m ± rj{ji)) , for m odd 

are called the positive and negative graphical Krein indices of the eigenvalue branch 
[i = /i(A) corresponding to the characteristic value Ao- The sums of Kg (/i, Ao) over 
all eigenvalue branches crossing at (A,/i) = (Ao,0) are called the positive and neg- 
ative graphical Krein indices of Ao and are denoted Kg(Ao). Finally, K g (/i, Ao) := 
Kg (/x, Ao) — Kg (fi, Ao) is called the graphical Krein signature of the eigenvalue branch 
fi = /i(A) vanishing at Xq, and K g (Ao) := Kg"(Ao) — KgT(Ao) is called the graphical Krein 
signature of Xq . 

Note that it follows directly from the definition of the graphical Krein indices 
Kg (/i, Ao) that the graphical Krein signature K g (/i,A) necessarily takes one of the 
three values 1, —1, or 0. The concept of sign characteristics of operator pencils [231 
Chapters 7 and 12] is closely related to our definition; here each root space of C 
corresponding to a characteristic value Ao is associated with the sign of /i(Ao) ^ 0, 
where fi(X) = (A - A ) m " 1 /i(A) near A = A . See also (23j Theorem 5.11.2]. The 
relationship between sign characteristics and Krein indices can be expressed by an 
algebraic formula [531 P- 77, equation (5.2.4)]. 

Due to this correspondence, the Krein signature as given by Definition|3.2|and the 



graphical Krein signature as given by Definition 3.3 are well-known to be connected 
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in the theory of matrix pencils (see (1.10) for the relevant calculation in the case of 
a simple characteristic value). The connection is described in detail in [331 P- 260, 
Theorem 12.5.2] (see also [HJ §10.5]). The proof as formulated in these works for 
matrix pencils is essentially the same as for operator pencils. A different proof of a 
more perturbative nature can be found in [361 Lemma 9]. See also |lll Lemma 5.3] 
and [72] Theorem 1] for the same result in the special case a generalized eigenvalue 
problem (linear pencil) . Yet another proof based on the Frobenius rule can be found 
in [H Theorem 4.2]. 

Theorem 3.4 (Graphical nature of Krein indices and signatures). Let C be a 
self-adjoint polynomial operator pencil of degree p with invertible leading coefficient 
L p , and assume also that C is a holomorphic family of type (A) with compact resolvent. 
Let X be an isolated real characteristic value of £. LetlA = spanjiJ ] , vP~\ . . . , ut™ -1 !} 



be a the root space for Xq corresponding ( according to Theorem 3.1) to the analytic 
eigenvalue branch [i = /i(A) of the associated eigenvalue problem (3.1 1. Then 



-~(U,X ) 



(3.13) 



from which it follows that 



k(U,X ) = K g ((l, A Q ) 



(3.14) 



and summing over all root spaces U and corresponding eigenvalue branches fx, 

k (Aq) = (Aq) and k(Aq) = /%(A ). (3.15) 



In particular, this result implies that for simple characteristic values Ao, the Krein 
signature can be calculated easily from the formula (1.9 1. This result allows us to 



extend the notion of Krein signature in a very natural way to problems arising in 
the theory of stability of nonlinear waves that cannot easily be formulated as spectral 
problems for polynomial pencils as the spectral parameter A enters into the problem in 
two different, transcendentally related, ways (as in Example[8]). Finally, the graphical 
Krein signature is the one that is most easily detected by a natural generalization of 
the Evans function, the subject that we take up next in fj4] From now on, when we 
refer to total Krein indices and Krein signatures of a real characteristic value Aq, we 



will always mean the (more widely applicable) graphical quantities of Definition 3.3 
and omit the unnecessary subscript "g" . 

4. Evans Functions. Consider a dynamical system linearized about an equilib- 



rium in the form (compare to ( 1.2 ) ) 



du 
lib 



Au. 



(4.1) 



In the infinite-dimensional context typical in nonlinear wave theory, the linearized 



system (4.1 ) is usually a partial differential equation and u is a vector in some Hilbert 
space X. For waves in 1 + 1 dimensions, X is a space of functions of a real spatial 
variable x, and A can be thought of as a linear differential operator acting in X. The 



key spectral problem in the stability analysis of the equilibrium is (compare to (1.1)) 



iXuo = Auq, 



(4.2) 



as each eigenvector uq — uq(-) € X of (4.2 1 generates a solution of (4.1) in the 



separated form u = u(x,t) — e 1 uq(x). Values of v = —iX G o~(A) having a positive 
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real part imply the existence of exponentially growing modes in (4.1 1 and thus linear 
instability of the equilibrium of the original (nonlinear) dynamical system. 

Although numerical methods to study o~(A) based on spatial discretization or 
some other finite dimensional truncation are easy to implement, they may fail to 
detect the full extent of the unstable spectrum or they may introduce spurious A, 
particularly in the vicinity of any continuous spectrum of A. In |15j-[18j a more robust 
and reliable numerical method was developed based on a new concept now called the 
Evans junction, and this method was successfully applied to study the stability of 
neural impulses. Since its introduction, the Evans function has become a popular tool 
for the detection of stability of various types of waves in many applications including 
fluid mechanics, condensed matter physics, combustion theory, etc. 

4.1. Typical construction and use of Evans functions. We begin with a 
standard definition. 

Definition 4.1 (Evans functions). Let C = C(X) be an operator pencil. An 
analytic function D : f2 C C — > C whose roots in £1 coincide exactly with isolated 



characteristic values X = Xq of the spectral problem (2.23), and that vanishes at each 
such point to precisely the order of the algebraic multiplicity of the characteristic value 
is called an Evans function for L (on n). 

It is common to refer to "the" Evans function for a given spectral problem, and 
this usually implies a particular kind of construction appropriate for problems of the 



special form (4.2) that we will describe briefly below. However the key properties of 
analyticity and vanishing on the discrete spectrum with the correct multiplicity are 
shared by many other functions and as the freedom to choose among them can be 
useful in applications, we prefer to keep the terminology as general as possible. 



Let us now describe the classical construction of "the" Evans function for (4.2 1 



in the case that A is a scalar ordinary differential operator of order k acting in X. To 



begin with, the spectral problem (4.2) is rewritten as a (nonautonomous, in the usual 
case that A has non-constant coefficients) first-order systerrj^j 

dv/dx = B{x,X)v, veX k , (4.3) 

where B(x, X) is a k x k matrix- valued function assumed to take finite limits B±(X) as 



x — > ±oo. We consider (4.3) along with the constant-coefficient "asymptotic systems" 

dv/dx = B ± (X)v. (4.4) 



The first-order system (4.3) has a nonzero solution v(x) decaying as \x\ oo, and 



hence (4.2) has a nontrivial solution u G X, exactly for those values of A for which 



the forward evolution W u °°(x) of the unstable manifold of the zero equilibrium of 



(4.4) for B_(X) has a transversal intersection with the backward evolution W^ r °°(x) 



of the stable manifold of the zero equilibrium of (4.4) for B + (X). To properly define 
the evolutes W~°°(x) and W^~°°(x) for a common value of x <E R requires that 
both asymptotic systems ( |4.4[ ) are hyperbolic with "exponential dichotomy" (that 
is, the eigenvalues of -B±(A) are bounded away from the imaginary axis as A varies 
in the region of interest), and for the existence of isolated characteristic values A 
one usually requires complementarity (in C fe ) of the dimensions of the stable and 



11 There are of course many ways to rewrite a single higher-order linear differential equation as 
a first-order system, and if care is not taken key symmetries of the original equation can be lost in 
the process. This is true even in the case of although for such problems with Hamiltonian 

symmetry Bridges li has shown how some of this structure can be retained. 
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the unstable manifolds at x = ±00. The traditional Evans function D(X) detecting 
transversality of the intersection of the evolutes can then be expressed as a k x k 
Wronskian determinant whose columns include p- < k vectors spanning W~°°(x) 
and p + := k — p_ vectors spanning W^°°{x). For coefficient matrices B(x, A) of trace 
zero, Abel's Theorem implies that D(X) is independent of the value of x £ M. at which 
the Wronskian is computed. For systems with nonzero but smooth trace, any value of 
x6l can be chosen but different values of x lead to different Evans functions -D(A) the 
ratio of any two of which is an analytic non-vanishing function of A. Usually in such 
situations the unimportant dependence of D(X) on x € K is explicitly removed by an 
appropriate exponential normalization |63] . This type of construction has been given 
a topological interpretation by Alexander et al. pQ, who related -D(A) to the Chern 
number of a tangential fibre bundle. Pego and Weinstein [64] used this definition of 
D(X) to establish stability properties of solitary waves in nonlinear systems including 
the Korteweg-deVries, Benjamin-Bona-Mahoney, and Boussinesq equations and also 
pointed out a connection between D(X) and the transmission coefficient associated 
with Jost solutions in scattering theory (see also |75j ) . This type of construction has 
also been extended to problems in multiple dimensions where A is a partial differential 
operator; in these cases special steps must be taken to ensure the analyticity of the 
resulting Evans function. In low dimensions a useful generalization of the Wronskian 
can be constructed using exterior products [53] , and a robust numerical algorithm has 
also been developed [31] for this purpose that uses continuous orthogonalization to 
obtain an analytic Evans function. 

The main reason for insisting upon the key property of analyticity of an Evans 
function is that the presence of characteristic values, as zeros of the analytic function 
-D(A), can be detected from a distance in the complex plane by means of the Argument 
Principle. Indeed, to determine the number of characteristic values (counting alge- 
braic multiplicity) in a two-dimensional simply-connected bounded domain of the 
complex plane, it suffices to evaluate an Evans function only on the one-dimensional 
boundary dil and to compute the winding number of the phase of D(X) as A traverses 
this curve. If the winding number is zero, one knows that there are no characteristic 
values in Q. Otherwise, fi can be split into subregions and the process repeated to 
further confine the discrete spectrum. From the computational point of view this is 
both more robust and far less expensive than a fully two-dimensional search of the 
A-plane for characteristic values. Searching for characteristic values by computing the 
winding number of an Evans function D(X) along closed curves is a particularly useful 
approach to the spectral problem ( 2.23 ) if one knows, say by a variational argument, a 
bound of the form |A| < M for the characteristic values of the problem. However, even 
in cases when such a bound in unavailable, it turns out that if the Krein signature of 
certain characteristic values (usually the real ones) is known or can be computed it is 
sometimes possible to justify numerics and to reduce computational costs significantly 
[44] . This suggests that it would be particularly useful if it were possible to extract 
the Krein signature of a real characteristic value directly from an Evans function. 



4.2. Evans functions and Krein signatures. By definition, an Evans func- 
tion D(X) detects characteristic values of (2.231 but the usual constructions produce 
Evans functions that do not provide or contain any information about the corre- 
sponding root vectors. To better understand the problem, first consider a simple 
finite-dimensional case of the linearized Hamiltonian spectral problem (1.1) and the 
related spectral problem (1.4) for an equivalent linear pencil C(X) — L — XK. In this 
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case the most obvious definition of an Evans function is to set 



£>(A) := det(£(A)) = det(i - XK). 



(4.5) 



Note that as L and K are Hermitian matrices, D is real-valued as a function of A £ R 
but of course it has meaning for complex A as well. It might seem natural to expect 
that the Evans function also encodes the Krein signature of the characteristic values 
in some sense. It has frequently been thought that under some circumstances at least 
the sign of the slope D'(X ) may be correlated with the Krein signature k(A ). To 
explore this idea, we present the following two examples illustrating the graphical 



interpretation of the Krein signature as explained in Definition 3.3 and Theorem 3.4 
and comparing with the slope of D(X) at its roots. 

< Example 9. Let L and J be given by (compare to Example El) 



L 













f° 





2 


°\ 


-1 








, J = 











1 





1 





-2 





















vO 


-1 





0/ 



The four real eigenvalue branches fi = /i(A) of the associated spectral problem (3.1| 



for this example are plotted along with the graph of D(X) as given by (4.5 ) in the left- 
hand panel of Fig. |4.1| The Krein signature of each of the real characteristic values 




Fig. 4.1. The four eigenvalue branches /i = ^t(A) of the linear pencil C(X) for Examples \9\ 
\10\ are plotted with solid curves. The real intercepts of the branches with fi = corresponding to 
real characteristic values of C are indicated with dots. The dashed curve is the graph of the Evans 
function (arbitrary units) D(X) = det(£(A)). 



A = Ao can be easily calculated graphically using Definitio n |3.3| and Theorem 3^ or 
equivalently as all four characteristic values are simple, via ( |1.9[ ). In this example it is 
easy to see that for each of the four real characteristic values, k(Xq) = — sign(D'(Ao)), 
and therefore the sign of the slope -D'(A) of D(X) at its zeros is strongly correlated 
with the Krein signature. [> 

< Example 10. Let L and J be given by (compare to Example [T]) 



L = 



\ 



0.5 


















1 







-1 








, J = 











1 








-1.5 





-1 




















- 2 J 




lo 


-1 





°/ 



This case is illustrated in the right-hand panel of Fig. |4.1| There are again four 
simple real characteristic values of C (here consistent with the fact that L is negative 
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definite), and by using the graphical definition ( |1.9| ) we see that the positive (negative) 
characteristic values each have Krein signature k(A ) = +1 (k(Aq) = —1). By contrast 
with Example[9j in this case it is obvious that the slope of -D(A) at its roots is positively 
correlated with the Krein signature of half of the characteristic values and is negatively 
correlated with the Krein signature of the other half. Hence D'(A) does not detect 
the Krein signature correctly as the Evans function fails to capture the attachment 
of the characteristic values to different eigenvalue branches fi = fi(X). > 

These simple examples illustrate the state of understanding in much of the liter- 
ature: it just is not clear under which circumstances one may expect derivatives of 
an Evans function to encode Krein signatures of characteristic values. 

4.3. Evans-Krein functions. In the general setting of operator pencils we now 
address the question posed in |37| (in the context of linearized Hamiltonians) as to 
whether it is possible to determine Krein signatures from an Evans function. In fact 
we can answer this question in the affirmative by showing how a simple generalization 
of the notion of an Evans function correctly detects the Krein signature in every case. 
Our approach provides a simple way to calculate Krein signatures of real characteristic 
values of selfadjoint operator pencils by an elementary augmentation of virtually any 
existing Evans function evaluation code at almost no additional programming cost. 
The main idea is to build into an Evans function information about the individual 
eigenvalue branches fj, = /i(A) to which the real characteristic values are associated as 
roots. In fact, this simply amounts to bringing in /i as an additional parameter. We 
now introduce such an appropriate generalization of an Evans function, which we call 
an Evans-Krein function. 

Definition 4.2 (Evans-Krein functions). Let C = C(X) be an operator pencil 
acting on a space X, and consider the related pencil M. = M(X; fi) := C(X) — fil 
where I denotes the identity operator on X and where /i is a sufficiently small real 
parameter. An Evans function E(X;fi) (in the sense of Definition \4-l\ ) for the /Lt- 
parametrized pencil M. is called an Evans-Krein function for C. 

Under suitable general conditions on C, an Evans-Krein function is not only an- 
alytic for A G Q C C for each small /i, but also will be analytic in both A and /i on 
a set of the form fl x {\fj,\ < 5} C C 2 for some S > 0. Upon setting /i = we see 
that if E(X; /i) is an Evans-Krein function for £, then D(X) := E(X; 0) is an Evans 



function for C, with all of the coincident properties as given in Definition 4.1 It 



is also clear that if /i = fi(X) is a real eigenvalue branch of the problem (3.1), then 
E(X; n(X)) vanishes identically for all A G C for which the branch is defined and /j,(X) 
is sufficiently small. Moreover, if Ao € K is a real characteristic value of C of geometric 
multiplicity k > 1, and if /Zi(A), . . . , /Lt^ (A) are the corresponding eigenvalue branches 



of (3.1 1 crossing at the point (A, fi) = (Ao, 0), then in a C neighborhood of this point, 



an arbitrary Evans-Krein function can be locally represented in the normal form 

k 

£(A;/i) = <7(A;/i)nG«-^(A)) (4.6) 

3=1 

where C(A; fi) is an analytic function with C(Ao; 0) =/= 0. 

Construction of an Evans-Krein function in practice can be very similar to that 
of an Evans function in the usual sense. For a matrix pencil C one can set 



E(\;n) := det(£(A) - //I), and D(X) := E(X;0). (4.7) 
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This definition is also suitable if C is an operator pencil that is a trace-class pertur- 
bation of the identity, in which case the determinants above are to be understood as 



Fredholm determinants. In the case that the spectral problem (2.231 involves differ 



ential operators and one is led to consider the traditional construction of an Evans 



function (as described in S4.1) via a Wronskian of subspaces of solutions decaying 



at the opposite spatial infinities, exactly the same construction gives an Evans-Krein 
function if the pencil C is merely perturbed into M. , because for sufficiently small /x it 



is easy to guarantee the persistence of hyperbolicity of the asymptotic systems (4.4 1 
as well as the complementarity of the stable and unstable subspace dimensions. 

4.4. Calculation of Krein signatures from an Evans-Krein function. Let 

E(X\ n) be an Evans-Krein function for a selfadjoint operator pencil C for which 
eigenvalue branches fi = /i(A) of the associated problem (3.1) are sufficiently smooth, 
and let A = Ao be an isolated real characteristic value of C. By differentiating the 
identity E(X; fi(X)) = with respect to A along a particular smooth branch /i = /i(A) 
for which fJ-(Xo) = and then evaluating at A = Ao we easily obtain 



S A (A o ;0) + ^(Ao;0)/x'(Ao) = 0. 



(4.8) 



According to the normal form (4.6), the partial derivatives E\(Xq;0) and E^(Xq;0) 



are both nonzero under the assumption that the characteristic value Ao is simple, 
which implies both that the number of eigenvalue branches fi = /i(A) of (3.1 ) crossing 
the /i = axis at A = Ao is exactly k = 1, and also that Ao is a simple root of the 
branch function ^(A) (see ^3] for details). Therefore, in this case (corresponding to a 
simple real characteristic value Aq) we have 



£a(A o ;0) 
^(A ;0) : 



(4.9) 



and then comparing with the graphical formula (1.9) for the Krein signature of such 
a characteristic value we learn that 



k(A ) = sign(/i'(A )) 



-sign 



£a(A o ;0) 
£„(A ;0) 



(4.10) 



Recalling that E(X; 0) is necessarily an Evans function D(X) for £, this formula can 
be re-written in the equivalent form 



k(X ) 



-sign 



^(A o ;0) 



(4.11) 



This formulgpl shows how the Krein signature of a simple real characteristic value Ao 
indeed relates to the derivative of an Evans function; the reason that the relationship 
seems ambiguous as shown in Examples [9}fT0| is that to obtain agreement one must 
take into account an additional factor, namely the sign of -E M (Ao;0) which can be 
different for different characteristic values Aq. 



12 The specific Evans function D(X) in the numerator of course depends on the Evans-Krein func- 
tion E(A; fi) from which it arises upon setting fj, = 0. This has some immediate implications; 
for example if one considers a standard eigenvalue problem Lu = Xu for a selfadjoint operator 
L, upon introducing the selfadjoint pencil C(X) := L — XI and the corresponding /i-perturbation 
M(X; fj,) := L — XI — fil = L — (A + fi)I one sees that E\(Xq; 0) = E m (Aq; 0) for all real character- 
istic values (eigenvalues of L) Ao. Hence the Krein signature of each simple characteristic value is 
necessarily positive, even though by Rolle's Theorem D'(X) = _B^(A;0) will have sign changes. 
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An Evans-Krein function E(X; fj,) can also be used to provide information about 
the positive and negative Krein indices of the eigenvalue branch(es) passing through 
fi = at an isolated non-simple real characteristic value Ao of a selfadjoint operator 
pencil. According to the theory presented in Sj3] (see Definition 3.3), the (graphical) 



Krein indices Kg (/x, Ao) associated with the eigenvalue branch // = fi(X) are known 
in the case of odd algebraic multiplicity (order of vanishing of /i(A) at Ao) to once 
the sign of ^"^(Ao) ^ is known (if to is even there is nothing to calculate). But 
// m )(Ao) can also be expressed in terms of nonzero partial derivatives of an Evans- 
Krein function E(X;n) at the point (A,/i) = (Ao,0). The main idea is to calculate 
sufficiently many derivatives of the identity E(X] n(X)) = at the point A = Ao. By 
repeated application of the chain rule, one finds that 

n=0 x ' ^ 



r-l 



+ E E E / • • • ■ ^) ^f_ p (A;/x(A)) I] M W (A), (4.12) 

p=l n+s=p d\<---<d n j—1 
diH hd„=p 

where n, s, and dj for j — 1, ...,n are positive integers and P r ,n,s{di, ■ ■ ■ , d n ) are 
certain complicated coefficients. Many of the partial derivatives of E can be evaluated 



at A = Aq and /x(Aq) = by using the normal form (4.6). While one can develop a 



general theory, to keep things simple we just present two representative examples. 

<\ Example 11. Real characteristic values of geometric multiplicity k = 1. For a real 
characteristic value Ao of C having geometric multiplicity k = 1 but arbitrary finite 
algebraic multiplicity m > 1, there is a unique eigenvalue branch [i = /ii(A) (analytic 



under suitable assumptions) of the problem (3.1) passing through the point (A, [i) 
(A ,0), and we have /^i(Ao) = Mi(Ao) = • • • = /4 m_1 ^(Ao) = but ^"'-'(Ao) ^ 0. If to 



is even, by definition we have n^{^\, Ao) = to/2, while for to odd we need to calculate 

/i^ m ^(Ao) to determine the Krein indices. But this is easily obtained from an Evans- 
Krein function E(X;/j,) by calculating m derivatives of the identity E(X; Mi (A)) = 



using (4.12) and evaluating at A = Aq. Only two terms survive for A = Aq due to the 



fact that /ii(A) vanishes there to order m. Indeed, 

— (X o :0) + —(X o ;0)fi{ m) (X (l )=i> (4.1.8) 



£(A;mi(A)) 



dX 



A=An 



and since E^Xq; 0) ^ as follows from the fact that k = 1, we obtain 

I ft 171 E 

^^—E^d^^- ^ 

Note that d m E /dX m (Xo; 0) ^ because this is the m-th derivative of an Evans 
function -D(A) at a characteristic value of algebraic multiplicity to. t> 

< Example 12. Semisimple real characteristic values. If Ao is a semisimple real 
characteristic value of a selfadjoint operator pencil C of geometric and algebraic mul- 
tiplicity k > 1, then there are k (analytic under suitable assumptions) eigenvalue 



branches fi — f-ij(X), j = 1, . . . , fc, of the problem (3.1 ) each of which has a simple 
root at A = Aq. To determine the Krein indices it suffices to express /^-(Ao) in terms 
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, k. Now, from the normal form (4.6 1 
C(X„;p)p k with C(A o ;0) ^ so in 



of an Evans-Krein function E(X; fi) for j = 1, 
one can easily see that in this case E(X ;ii) 
particular E^(Xo; 0) = unless k — 1 (and then Ao is simple), implying that in general 
the method of differentiation once along each branch individually and evaluating at 
A = Ao as in the calculation leading to (4.9) will fail. 



However, using the normal form (4.6) and the first-order vanishing of all k of the 



branches crossing at (Aq,0) one can easily show that 



d n+j E 



(A ;0) = 0, n + j<k. 



(4.15) 



If we choose a branch fi = fij(X) and differentiate the identity E(X; /Zj(A)) = exactly 
k times with respect to A at A = Ao, then with r = k in (4.12) the identities (4.15) 



guarantee that all of the terms on the second line vanish identically Therefore the k 
values z — /ij-(Ao) are determined as the roots of the A-th order polynomial equation 



E 



d k E 



dfi n dX k 



-(A ;0)z n = 0. 



(4.16) 



This equation is genuinely k-th order because the normal form (4.6) and our assump- 
tions on fJ.j(X) imply that 



d k E 



(A o ;0)=n!C(A ;0) 



d k ' n ln 



5/x"5A fc -" v " u '" / """ dX k 
Here, 7 P (A) is defined as the coefficient of pP in the product 



(Ao)- 



(4.17) 



n(M-Mi(A))=E7p(A)M P 



(4.18) 



p=0 



Since 7&(A) = 1, the coefficient of z k in (4.16) is /c!C(A o ;0) ^ 0. Likewise, since 
7o(A) = (— 1) /ii(A) ■ • • jUfe(A), the constant term in the polynomial on the left-hand 



side of (4.16) is (— l) fe C(Ao; 0)/z' 1 (Ao) • • • a4(Ao) 7^ 0, which implies that all of the roots 
of (4.16) are nonzero. That all k of the roots are real numbers is less obvious but 



necessarily true. > 

In summary, as was observed in [36l [38] , an Evans function D(A) alone can- 
not explain the origin of Krein indices and signatures of characteristic values. The 
geometrical reason is simple: -D(A) is proportional to the product of all eigenvalue 
branches n(X) of the problem ( |3.1[ ), and no amount of differentiation with respect to 
A will separate the individual branches. However, the branches can indeed be sep- 
arated with the help of derivatives of an Evans-Krein function E(X; /i) with respect 
to [i at (X, fi) = (Ao,0) that encode the local behavior of each branch close to the 
characteristic value. 

4.5. Further generalization of Evans-Krein functions. For some purposes 



(see Example 13 below) it may be important to determine whether two distinct real 
characteristic values of a selfadjoint operator pencil L belong to (are obtained as roots 
of) the "same" eigenvalue branch p = /«(A) of the associated eigenvalue problem (3.1 ). 



Determining whether two real characteristic values are connected in this way requires 
analyzing the eigenvalue branches of the operator C(X) away from fi = 0. When we 



GRAPHICAL KREIN SIGNATURE AND EVANS-KREIN FUNCTIONS 



29 



described a typical construction of an Evans-Krein function in problems of stability 
analysis for nonlinear waves, we pointed out that when fj, is sufficiently small the 



hyperbolicity of the asymptotic systems ( 4.4 ) is guaranteed if it is present for fj, = 



If we wish to have a construction that works for larger values of /i, we can generalize 
the definition of an Evans-Krein function in the following way. 



Instead of (3.1) one can consider the /x-deformed characteristic value problem 

C(X)u = fiSu, (4.19) 

and define a generalized Evans-Krein function as an Evans function for the pencil 
.Ms (A; fi) :— £(A) — [iS, where S is a suitable positive definite selfadjoint operator on 
X. The correspondence between traditional and graphical Krein indices/signatures 
given in Theorem |3.4| can be shown to hold when the eigenvalue branches are defined 



by the modified problem (4.19 ) for a wide class of positive definite selfadjoint operators 
S : X — i X. In the case of spectral analysis of nonlinear wave equations linearized 
about a localized traveling wave solution (for which the Evans function construction 



described in f 4.1 applies), it is enough to define Su(x) = f(x)u(x) where f(x) is a 
positive Schwartz-class function. The rapid decay of / as \x\ — > oo guarantees that for 
all \x however large the asymptotic systems ( |4.4[ ) are exactly the same as they are for 
/i = and hence hyperbolicity is preserved. Such a generalized Evans-Krein function 
should be straightforward to implement in existing computer codes for evaluating 
Evans functions constructed as described in §4.1| Finally, we can observe that the 



freedom of choice of the operator S in (4.19 1 suggests a certain structural stability of 



the problem of determining Krein indices and signatures of real characteristic values 



of the problem (2.231. A specific example of a kind of problem in which one needs to 
consider values of /i that are not small is the following. 

< Example 13. Stability of vortices in axisymmetric Bose-Einstein condensates. The 
wave function 'F = ty(x,t) of a Bosc-Einstcin condensate confined in a harmonic trap 
evolves according to the Gross-Pitaevskii equation 

m t = (-Ia + ^ + ^A V, (4.20) 

where A is the Laplacian. In the two-dimensional case (x € 1R 2 , |x| 2 := x\ + x 2 ,) the 



Gross-Pitaevskii equation (4.20) has solitary wave (vortex) solutions of the form 



^{t, r, 9) = e^ m+1+ ^V m V-M, (4.21) 

where (r, 9) are polar coordinates, m is the degree of the vortex, m + 1 + [i is the 
vortex rotation frequency and w^ m (r) is the radial vortex profile. 

The stability of vortices in Bose-Einstein condensates has been well-studied. For 
example, in [33] an Evans function was used to study the spectral stability of vortices 
of degree m = 1,2 which exist for /i > 0. Reformulating the spectral stability problem 
studied in [44] in the language of selfadjoint operator pencils, spectral instability cor- 
responds to the presence of non-real characteristic values (mode frequencies) A of the 
spectral problem CbecWu = for a function y = (y + (r), y_(r)) T e L 2 (R + , C 2 ; r dr), 
where £bec(A) = ALi + Lq is the linear pencil with coefficients 



2 



L\ := -0-3, 

Lo:= {-hi r i + r 2 - m - 1 -^ +L ^ +2lw ^ m{rn r (4.22) 

j m 1 / m2 

+ -^-°3 + \W^ m {r)\ 0- X . 
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Here, j is an integer satisfying < \j\ < 2m that indexes an angular Fourier mode. 
Note that in the indicated weighted L 2 space, this operator pencil £bec is selfadjoint 
for bounded y that decay rapidly as r —¥ oo. 

In |44j the radial profile of the vortex and then the corresponding characteristic 
values of £bec were calculated numerically for a moderately large range of values of 
fi > 0. Note that /i > enters into the pencil £bec both through the linear term — /il 
and also through the radial profile \w^, m (r)\ 2 of the vortex. 

For to = 1 all the characteristic values of £bec are real and thus the vortex is spec- 
trally stable, while for m = 2 variation of the parameter /i leads to Hamiltonian-Hopf 
bifurcations creating "bubbles" of instability |55j in which two colliding characteristic 
values leave and subsequently return to the real axis. As it is typical in Hamiltonian 
systems, the underlying Hamiltonian has only a few negative eigenvalues limiting the 
number of possible real characteristic values of negative Krein signature. In fact, for 
unstable vortices (to = 2, j = 2) there is only one real characteristic value of negative 
Krein signature that repeatedly collides with the remaining real characteristic values 
of positive signature as the parameter \x increases. In |44j a rigorous theory of con- 
tinuity of Krein signatures for finite systems of characteristic values was developed, 
and its results were used to identify the unique real characteristic value having neg- 
ative Krein signature via a continuation (homotopy) method in \x starting from the 
explicitly solvable case of fi = 0. 

The apparent weakness of this method is that a stability check for large values 
of ji requires a significant computational overhead — the calculation of the charac- 
teristic values of the pencil £bec f° r a discrete sampling of a large interval of \i that 
is sufficiently dense to ensure continuity of characteristic values (and their Krein sig- 
natures). Naturally, once a characteristic value A has been calculated for any value 
of fi it is possible to determine its Krein signature by an evaluation of the particular 



quadratic form as in (1.8) at the corresponding characteristic vector. Unfortunately, 
the Evans function used in [44j was constructed from exterior products, and conse- 
quently it is not obvious how to numerically recover the characteristic vectors from the 
Evans function; this meant that the characteristic vectors and subsequently the Krein 
signature of each characteristic value had to be calculated separately. By contrast, it 
would be easy to capture the Krein signature of any real characteristic value directly 
from an Evans-Krein function with a minimal extra computational and coding cost 
for any value of /z, thus making calculations for large fi directly accessible. t> 

5. Index Theorems for Linearized Hamiltonian Systems. The graphical 
interpretation of Krein indices and signatures afforded by Theorem [3^4] can be applied 
to develop very simple proofs of some well-known index theorems (eigenvalue counts, 
inertia laws) for linearized Hamiltonian systems that appear frequently in stability 
analysis [24 ,26,37,65 where the central question is the presence of unstable spectrum 



of ( 1.1 ) in the right-half complex plane. Our goal is to show that the counts of unstable 
spectrum provided by these index theorems can be interpreted in terms of topological 
indices of planar curves in a quadrant. 

Recall Example [T] in which it was shown that unstable spectrum can only be 
present if L is indefinite. In applications to stability of nonlinear waves L typically 
represents the linearization of a Hamiltonian whose kinetic energy part is a positive- 
definite unbounded differential operator. In such cases L will itself be unbounded in 
the positive direction but may have a finite number of negative eigenvalues that might 
cause instability of the system. In this context, the purpose of an index theorem is 
to bound the number of possible points in the unstable (non-imaginary) part of the 
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spectrum a(JL) of (1.1 1 in terms of the total dimension of negative eigenspaces of L 
and information about the stable (purely imaginary) part of the spectrum. It turns 
out that this information amounts to the number of real characteristic values A of the 
corresponding linear pencil C given by ( |1.3[ ) having certain types of Krein signatures. 
(Recall that the real characteristic values of C are in one-to-one correspondence with 
the purely imaginary points v of a(JL) simply by a rotation of the complex plane: 
A = if.) Such theorems therefore generalize the simple statement that positivity of L 
prevents instability. An important application of index theorems is the justification of 
a numerical search for unstable spectrum; once the number of complex characteristic 
values of C obtained numerically (say via Argument Principle calculations involving 
an appropriate Evans function) equals the maximum number admitted by the index 
theorem, the numerical search can be considered complete |44j . 

For the sake of clarity, we will work exclusively in the finite-dimensional setting. 



All of our applications below will be based on Theorem 5.2 and while some aspects of 
its proof are not difficult to generalize to the infinite-dimensional setting (for instance, 
by counting codimensions rather than dimensions), the possible presence of an infinite 
number of discrete eigenvalue branches complicates other aspects of the proof. This 
subject will be addressed in a separate paper. 

To explain the index theorems graphically, we shift the focus from the local in- 



formation contained in the way the eigenvalue branches n(X) of (1.5) cross fx = to 



the global topological information stored in the branches for not small. 

5.1. Graphical analysis of selfadjoint polynomial matrix pencils. We 

begin by formulating some simple consequences of graphical analysis of eigenvalue 
curves /i = /i(A) corresponding to selfadjoint polynomial matrix pencils of arbitrary 



(odd) degree. In {5.2 below we will apply these results in the special case of linear 
pencils, but it is easy to envision future applications where greater generality could 
be useful. We use the following notation below. 

Definition 5.1. If L is a Hermitian NxN matrix, the number of strictly positive 
( respectively negative ) eigenvalues of L counted with multiplicity i s denoted N + (L) 
( respectively N_ (L) ). 

If C is a selfadjoint NxN matrix pencil, the number of strictly positive ( respec- 
tively negative ) real characteristic values of C counted with geometric multiplicity is 
denoted N + (C) (respectively N_(C)). The number of eigenvalue curves [i = /i(A) of 
C passing through (A, fi) = (0,0) with /J.(X) < for sufficiently small positive (re- 
spectively negative) X, will be denoted Z^_(C) (respectively Z^(C)), and analogously 
the number of eigenvalue curves /i = /i(A) for which /i(A) > for sufficiently small 
positive (respectively negative) A will be denoted Z+(£) (respectively z"L{C)). 

Note that whenever we refer to a number of eigenvalue curves as in Definition |5.1| 
we intend them to be counted weighted by their geometric multiplicity. It is important 
to stress that there may be curves counted in both Z+(£) and Z^(£), namely those 
vanishing at A = to even order m with ^( m )(0) < 0. If the order of vanishing m is 
even but fi^ m \0) > 0, then the corresponding curve will be counted in neither Z\(C) 
nor Z^(C). Curves vanishing at A = to odd order will be counted in exactly one of 
Z l + {C) or zi(C). 

Of course N + (L) + N_(L) + dim(Ker(L)) = N, and also z£(£) + zl(C) = 
Z^{C)+Z\{C) = dim(Ker(£(0))). In general neither N+(£) nor iV_(£) is necessarily 
finite, although the situation is better for polynomial pencils as we now recall. A 
polynomial matrix pencil C = C(\) := Lq + XLi + ■ ■ • + X p L p acting in X = C N 
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equipped (as will be understood throughout this section) with the Euclidean inner 
product is selfadjoint if and only if the coefficients L , . . . ,L p are all represented in a 
fixed basis of X as Hermitian N x N matrices. According to Definition |2.1| the total 
algebraic multiplicity of all characteristic values making up the spectrum cr(C) of such 
a pencil is equal to pN under the additional assumption that the leading coefficient 
L p is an invertible matrix. We have the following result. 

Theorem 5.2. Let C — £(A) := Lq + XL\ + ■ • • + X p L p be a selfadjoint matrix 
pencil of odd degree p = 11 + 1 acting on X = C N , and suppose that L p is invertible. 
Then we have the fundamental graphical conservation law 

N - 2N_(L ) - Z±(£) - Zi(£) + ^ k(X) - K ( A ) = °- ( 5 - 1 ) 

AGct(£) Xea(C) 
A>0 A<0 



Also, the following inequalities hold true: 

N ± (C)> N_(L ) + Zi(£) - N T (L p ) 



(5.2) 



Proof. According to Theorem 2.2 the TV eigenvalue curves /i = //(A) associated 
with the selfadjoint pencil C may be taken as holomorphic functions of A € K. The 
assumption that L p is invertible implies that none of the eigenvalue curves can be a 
constant function, and hence each curve has a well-defined finite order of vanishing at 
each corresponding real characteristic value of C Let Q± denote the open quadrants 
of the (A, /i)-plane corresponding to fi < and ±A > 0. By analyticity and hence 
continuity of the eigenvalue branches /i = /u(A), the number of branches entering Q± 
necessarily equals the number of branches leaving Q± (with increasing A). 

Consider first the quadrant <3_. Curves fj. — /x(A) can enter Q_ with increas- 
ing A in only two ways: from A = — oo (corresponding to curves with fJ-(X) < 
asymptotically as A — > — oo) and through the negative A-axis. Since the odd-degree 
polynomial pencil C is dominated by its invertible leading term X p L p as |A| — > oo 
(as is easy to see by applying Rouche's Theorem to the characteristic polynomial of 
the matrix A~ P £(A)) , the number of curves entering Q_ from A = — oo is precisely 
N + (L p ). Similarly, curves fi = fJ,(X) can exit Q- with increasing A in exactly three 
ways: through the negative /it-axis, through the origin (A, /i) = (0, 0), and through the 
negative A-axis. Since C(0) — Lq, the number of curves exiting Q_ through /i < is 
precisely N^(Lq), and Z^(C) is the count of curves exiting Q- through the origin. 
Therefore "conservation of curves" for the quadrant Q_ reads 

N+(Lp) - N-(Lo) - Z±(C) 

= Net number of curves exiting Q- through A<0,yu = 0. (5.3) 

If Ao is a real characteristic value of C, then according to Definition |3.3| and The- 
orem 3.4 the Krein signature h{Xq) exactly counts the net number of eigenvalue 
branches pi = /x(A) vanishing at Ao exiting the half-plane fi < with increasing A. 
Therefore conservation of curves for Q_ can be rewritten equivalently as 

N + (L p ) - N_(L ) - zi(C) = Yl ( 5 ' 4 ) 

AGo-(£),A<0 
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A conservation law for curves in Q + is obtained similarly: 



N-(L P ) - N-(Lo) - Zi(C) 

= Net number of curves entering Q + through A > 0, /i = 

= - K ( A )- (5 ' 5) 



\£<t(C),\>0 



Adding together (5.4 1 and (5.5 1, and taking into account the fact that N_(L p ) + 
N + (L p ) — N because L p is Hcrmitian Nx N and invert ible, we arrive at the graphical 
conservation law (5.1|. 



The inequalities (5.2) follow immediately from the equations (5.3) and (5.5) by 



noting that the absolute value of the net number of curves crossing the positive or 
negative A-semiaxis is a lower bound for the number of curves crossing the same 
axis, and the latter is exactly a count of real characteristic values according to their 
geometric multiplicity. □ 



The key ideas of the proof are illustrated in Figure 5.1 As the proof is essentially 



\\ / \\ , 
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Fig. 5.1. The proof of Theorem \5.3\ is essentially graphical in nature. In each quadrant Q± 
the number of curves entering and exiting through infinity, the negative fi-axis, and through the 
origin must be balanced by the net numbers exiting through the positive and negative X-semiaxes as 
calculated by summing Krein signatures. The negative eigenvalues of Lq are plotted with blue dots. 



topological in nature, the analyticity of the branches implied by Theorem 2.2 could 
be replaced by mere continuity. 

5.2. Application to linearized Hamiltonian systems. We now return to 



(1.1), the spectral problem JLu = vu for a linearized Hamiltonian system, where L 



and J are (Hcrmitian and invertible skew-Hermitian, respectively) matrices of size 
N x N, and we recall that in the finite-dimensional setting the underlying space 
X = C N necessarily has even dimension N = 2n due to invertibility of the skew- 
Hermitian matrix J. Let the total algebraic multiplicity of all points v £ a(JL) with 
Re-jV} ^ be denoted 2n u (necessarily an even number due to the basic Hamiltonian 
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symmetry v — > —v). This is the dimension of the unstable invariant subspace of JL. 
The total algebraic multiplicity of all purely imaginary points of cr(JL) is therefore 
In — 2n u — 2n s , so 2n s is the dimension of the (spectrally) stable invariant subspace 
of JL. The first result is the following. 

Theorem 5.3 (Index theorem for linearized Hamiltonians). Let L and J be 
2n x 2n matrices, and let L be Hermitian and J be invertible and skew-Hermitian. 
Let L = £(A) := L — XK , K := (iJ)^ 1 , be the associated linear matrix pencil. Then 

n u = N_(L)-(- J2 E K ~( A )' ( 5 ' 6 ) 

Ae<r(£) : A>0 \£<t(C),\<0 

The Krein indices K (A) > are associated with real characteristic values A of the 
selfadjoint pencil C, which in turn correspond via rotation of the complex plane to 
purely imaginary points in er(JL). Here the quantity £ is given by 

C = \ dim(gKer(JL)) - \ (z|(£) + zi(C)) , (5.7) 

or equivalently by 

C := - (dim(gKer(JL)) - 2 dim(Ker(i))) + i (z\{L) + &_{£.)) . (5.8) 



Proof. We apply Theorem 5.2 to the selfadjoint linear pencil C associated with 
L and J. Here p = 1, and Lq = L while L\ = —K. Note that K is invertible. With 
N = 2n — 2n u + 2n si the graphical conservation law (5.1 ) reads (upon rearrangement 
and division by 2) 



1 



n u = N_(L) + -(ZX(£) + Zt(£)) 



(5.9) 



A>0 



XGcr(C) 
A<0 



Of course by definition the Krein signatures k(A) appearing on the right-hand side 
can be written as differences of two positive Krein indices: 



k(X) = k + (X)-k-(X). 



(5.10) 



But n s can also be written in terms of the Krein indices of the real characteristic values 
of C Indeed, the total algebraic multiplicity of a purely imaginary point v G cr(JL) is 
equal to that of the corresponding real characteristic value A = iv of C, and the latter 
can be expressed as the sum of the Krein indices k + (X) + n~ (A). Therefore, summing 
over all real characteristic values of C we obtain: 



2n s 



(« + (A) + K -(A)). 



(5.11) 



Substituting this identity and (5.10) into ( |5.9[ ) , and noting that as a count of algebraic 
multiplicity we have k + (0)+k~(0) = dim(gKer(JL)) if € cr(JL), we obtain ( |5~6| ) with 
C given in the form (5J}. Since Z|(£)+Z|(£) = zl(C) + zi(C) = dim(Ker(£(0))) = 
dim(Ker(L)) the alternative formula (5.8) follows immediately. □ 

Recall that if J and L are real matrices, then in addition to the Hamiltonian 
symmetry a(JL) = —a(JL) we have the real symmetry a(JL) = a(JL), in which case 
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system is said to have full Hamiltonian symmetry. In many applications great utility 
is gleaned from this extra symmetry, but one should be aware that the reality of the 
matrices J and L (or operators in the infinite-dimensional setting) is essentially tied 
to a choice of basis or representation. In other words, within the usual Hilbert space 
axioms one finds no inherent notion of what it means for a vector or operator to be 
"real" . Therefore, a problem can possess full Hamiltonian symmetry even though it 
may not be obvious that the operators are "real" due to poor choice of coordinates. 
A useful general notion of reality is based instead on the existence of an appropriate 
antiholomorphic involution on X and can be formulated as follows (compare with 29 , 
Assumption 2.15]). 

Definition 5.4. Let I : X — > X be an involution on a Hilbert space X that is 
unitary: ||/(w)|| = ||u||, Vw € X, and conjugate linear: I(au + (3v) — al(u) + f3I(v), 
Vzt, i)£l and Va, (3 G C. A linear operator A : X — > X is said to be real (with respect 
to I) if it commutes with I: AI(u) = I(Au) for all u € X . Similarly, a vector u G X 
is real (with respect to I) if u = I(u). 

The involution / is simply an abstraction of complex conjugation as might be 
applied to complex coordinates of vectors in A. In the case of real matrices J and L 
we have the following important corollary of Theorem |5.3| 

Corollary 5.5 (Index theorem for linearized Hamiltonians with full Hamilto- 
nian symmetry) . Let L and J be 2n x 2n matrices, and let L be Hermitian and J be 
invertible and skew- Hermitian. Suppose also that L and J are real with respect to a 
unitary antiholomorphic involution I : C 2n — > C 2 ". Then with C, being given by either 



(5.7) or (5 



iV_(£)-C-2 * + (A)=A_(L)-C-2 ]T «-(A), (5.12) 

Aeer(£),A>0 Aeer(£),A<0 



and moreover, £ can be simplified as follows: 

C= idim(gKcr(JL))-Z-(£) 



- (dim(gKer(J£)) - 2 dim(Kcr(L))) + Z + (£), 



(5.13) 



where Z + (C) (respectively Z~~(C)) is the number of curves [i = fJ-(X) vanishing at 
A = whose first nonzero derivative ^^ m -'(A) is positive (respectively negative). If in 
addition one has dim(gKcr( JL)) = 2 dim(Ker(L)), then £ = Z + (C). 

Proof. The fact that JL commutes with the involution / forces the whole system 
of curves /i = /i(A) to be symmetric with respect to reflection through the vertical 
/u-axis. This implies that if Ao > is a real characteristic value (with root space U) 
of the associated linear pencil £, then so is — Aq (with root space I(U)), and from the 



definition of graphical Krein indices (see Definition 3.3 and Theorem 3.4) we see that 



: (Aq) = k =f (— Aq) due to the symmetry of the curves. Therefore, both sums on the 



right-hand side of (5.6) are equal, and (5.12) follows immediately. 



The simplified formulae (5.13) for £ follow from the left-right symmetry of the 
union of curves /x = /i(A) passing through the origin (A,//) = (0,0). Indeed, the 
symmetry implies that the number of curves vanishing to odd order m is even and 
divides equally into the number of those curves for which fi^ (0) > and that of 
their reflections through the /i-axis. This implies that Z l + {£) + zi(C) = 2Z~ , and 
then since dim(Ker(L)) = dim(Ker(JL)) is the total number of analytic branches 
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through the origin, the proof of (5.13) is complete. The fact that the condition 
dim(gKer(JL)) = 2 dim(Ker(X)) implies that £ = Z + (C) is obvious. □ 

The next application concerns spectral problems for linearized Hamiltonian sys- 
tems for which L and J have so-called canonical form, that is, the 2n x 2n matrices 
can be written in terms ofnxn blocks as follows: 



J 











L 











(5.14) 



where I is the n x n identity and L± are n x n Hermitian matrices. Note that in this 
case J is automatically invertible. This form occurs naturally in many applications, 



but in fact the general JL spectral problem ( 1.1 1 can be rewritten in canonical form by 
an appropriate inflation of the dimension of the problem |37j . Under the assumption 
that Kcr(L + ) _L Ker(L_), a lower bound for the number of real points v £ a(JL) can 
be given in terms of the difference of the negative indices of operators M± that are 
suitable projections of the operators L±. 

Theorem 5.6 (Lower bound for real points of cr(JL)). Let L and J be 2n x 2n 
matrices having canonical form (5.14). Suppose also that Ker(L + ) _L Ker(L_), and 
let V denote the orthogonal complement of Ker(L+) © Ker(L_) in Y = C" with 
corresponding orthogonal projection P : Y — > V. If N^( JL) < 2n u denotes the number 
of nonzero real points in cr(JL) counted with geometric multiplicity (necessarily an 
even number by the basic Hamiltonian symmetry o~(JL) = —a(JL) ), then 



1 



Nm{JL) > \N-(M+) - JV_(M_)| 



M+ := PL+P. 



(5.15) 



Proof. First, we reduce the spectral problem JLu = vu on X = C 2 " to a spectral 
problem for a linear selfadjoint pencil on Y using the projection technique [24l l36"l l37l 
[72] , Identifying u € X with the pair (1*1,142) with Uj € Y for j = 1,2, the spectral 
problem JLu = vu takes the form of a coupled system: 

L-U2 = vu\ and — L + u\ = vu<i. (5.16) 

Now since Ker(L + ) _L Ker(L_), the space Y can be decomposed into a direct sum 
of three pairwise orthogonal summands: Y — Ker(L + ) © Ker(L_) © V. Letting 
P± : Y — > Ker(L±) be the orthogonal projections onto Ker(L-t), and using the facts 
that L±P± = and (as L± are selfadjoint) P±L± = 0, we may apply the projections 
P_ and P to the equation L-U2 = vu\ and hence obtain 

= vP-Ui , 

5.17 

PL-P+u-2 + PL-Pu 2 = vPu x , 
and similarly applying the projections P+ and P to the equation —L + ui = vu 2l 

= l>P+Un , 

+ ' (5.18) 
-PL+P-Ux - PL+Pui = vPu 2 ■ 

If v ^ 0, then we obviously have P^ui = P+u 2 = 0, and setting vj — Puj G V gives 
a coupled system of equations on the subspace V: 



M-V2 — vv\ and — M + v\ = W2- 



(5.19) 
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The advantage of this reduction is that M± are both invertible on the subspace V, 
and therefore by eliminating t> 2 we can write the original spectral problem JLu = vu 
for v ^ as a spectral problem C{X)v\ = where C is the linear selfadjoint matrix 
pencil on V given by 



£(A) :=L + \Li, L Q :=M+, L x := MI 1 , A := v 2 . (5.20) 



Now we invoke Theorem 5.2 specifically the inequality (5.2) for the number 
N+(C) of positive real characteristic values of C counted with geometric multiplic- 
ity. Since Lq is invertible as well as the leading coefficient L\, and since the total 
number of curves /i = fJ>(\) associated with the pencil C passing through the ori- 
gin (A,/z) = (0,0) is equal to dim(Ker(L )), there are no such curves at all in this 
application, thus Z±(C) = 0. Therefore (5.2) takes the simple form 



N+{£) > \N-(L ) - = \N_(M + ) - ^(AfZ 1 )) = \N-{M+) - iV_(M_)| . 

(5.21) 



The inequality (5.15 1 follows from the fact that N+(C) counts the number of positive 
points in cr(£), and hence by A = v 2 it is exactly half the number of real nonzero 
points in o~(JL), counted with geometric multiplicity. □ 



5.3. Historical remarks. Corollary |5.5| is a generalization of an index theorem 
proved independently by Kapitula et al. [37] and Pelinovsky [55J in the special case 
that each maximal chain of root vectors corresponding to the characteristic value A = 
has length 2. Such a situation corresponds to many typical Hamiltonian systems with 
symmetries [23 EZ]- m this case £ can be written in the form N-(D) where D is a 
Gram-type Hermitian matrix with elements Djk := (Lu^\u^) and the vectors u^' 
span gKer( JL) Q Ker(Z). 

The relation between the dimension 2n u of the unstable invariant subspace of JL 
and the number of negative eigenvalues of L was first studied for N—(L) = 1, in which 



case under the assumption that dim(gKer( JL)) — 2 dim(Ker(L)), the count (5.12) 
indicates that JL has at most one pair of non- imaginary (unstable) points of spectrum, 
and by the full Hamiltonian symmetry these points are necessarily real. In this case 
the famous Vakhitov-Kolkolov criterion [70] applies in a large class of problems in the 
stability theory of waves and identifies the scalar quantity £ = Z + (C) = N^(D) that 
determines whether n u — or n u = 1, i.e., whether a given wave is spectrally stable. 
In the Vakhitov-Kolokolov theory, the quantity N_(D) turns out to be equal to the 
derivative of the momentum (impulse functional) of the wave profile with respect to 
the wave speed. Pego and Weinstein [M] proved that the same quantity is related to 
the second derivative D"(0) of the (usual) Evans function D(X), which by symmetries 
of the system satisfies D(0) = D'(0) = 0. In celebrated papers, Grillakis et al. 
[2"ol |2"T] extended the spectral stability analysis associated to the Vakhitov-Kolokolov 
instability criterion to establish fully nonlinear stability properties of waves. See [5] 
for the analysis preceding these general results and [38 for a historical discussion. 
Simultaneously, Maddocks [55] developed the theory of inertial laws for constrained 
Hamiltonians and related it to results of MacKay [55] giving rise to an algebraic 
method for proving index theorems. 

Kapitula et al. |37j and Pelinovsky |65j proved a generalization of the Vakhitov- 
Kolokolov criterion to the case of N-(L) > 1. They derived the formula mentioned 
above for £ = Z + (C) in terms of the Gram matrix D, and they also interpreted the 



law ( 5.12 ) as a parity index by writing it in the form n u = N- (L) — N^. [D) (mod 2). 



This identity plays an important role in the presence of full Hamiltonian symmetry 
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because oddness of n n indicates the presence of purely real points of the unstable 
spectrum. Under some technical assumptions, Kapitula [36| reproved these results 
using a similar technique involving continuity of eigenvalue curves in conjunction with 
the so-called Krein matrix method. The latter effectively projects the problem to a 
finite-dimensional negative subspace of L, but in the process introduces unnecessary 
poles in the eigenvalue branches n(X), and these poles obstruct the simple graphical 
visualization of the Krein signatures and indices arising in the analysis. Kapitula and 
Haragus [33] also proved the same count in the setting of differential operators with 
periodic coefficients using the Bloch wave decomposition (Floquet theory) to reduce 
the problem with bands of continuous spectrum to a collection of problems each of 
which has purely discrete spectrum. See also [8] for an alternative proof of (5.12| 



using aspects of integrable systems theory. An analogous index theorem for quadratic 
Hermitian matrix pencils was proved by Pelinovsky and Chugunova [10 using the 
theory of indefinite quadratic forms (Pontryagin spaces) and was later reproved by 
a homotopy technique [43] resembling the graphical method described in this paper. 
Another related count for quadratic operator pencils was recently proved by Bronski 
et al. [9]. Theorem 5.2 is an example of the kind of results that can be obtained 



beyond the simple context of problems reducible to linear pencils. 

When the reality condition guaranteeing full Hamiltonian symmetry is dropped 



and one has instead of (5.12) the more general statement (5.6), it is no longer possible 



to interpret the inertia law as a parity index. Results analogous to those recorded 
in Theorem 5.3 were previously proven in [11] Theorem 6] and in |29[ Theorem 2.13] 
under the assumption that L is an invertible operator with a compact inverse. For the 



reader familiar with 11, 29 , the apparent difference between (5.6 1 and the formulation 
of the index theorems found therein is caused by a different choice of the definition of 



the Krein signature; see the footnote referenced between equations (1.7) and (1.8) 



Theorem 5.6 which concerns problems having the canonical symplectic structure 



(5.14), was proved independently and virtually simultaneously by Jones [34 and Gril- 
lakis [24]. Since the inequality (5.15) only provides a lower bound on the number of 



purely real points of <j(JL) and not its exact count, the straightforward generalization 
from the finite-dimensional setting to operator theory does not require completeness 
of the root vectors. Jones' proof [34 of Theorem 5.6 is of a very different nature from 



the graphical one we have presented, but that of Grillakis [24] is quite similar, with 
spectral projections playing the role of the eigenvalue curves \x = /i(A). We think 
that our approach, embodied in the proof of Theorem |5.2[ gives a very simple way to 
visualize the count. This problem was also studied by Chugunova and Pelinovsky |llj . 
who proved a number of related results using the Pontryagin Invariant Subspace The- 
orem applied to the linear pencil. More recently, a similar approach combined with 
the use of the Krein matrix, an analytical interpretation of the Krein signature, and 
the Keldysh Theorem [30] [57] , was employed by Kapitula and Promislow [35] , who 
reproved Theorem |5.6| (see the paper for further historical remarks). The connection 
to the linear pencil was also pointed out in a similar setting in [72 . 

6. Discussion and Open Problems. We hope that our paper has demon- 
strated how the analytic interpretation of Krein signatures and indices in terms of 
real curves and their order of vanishing at real characteristic values helps to easily 
visualize, simplify, and organize numerous results found in the literature on stability 
of nonlinear waves. 

The analytical or graphical interpretation of the Krein signature put forth in 



Definition |3.3| and Theorem 3.4 is apparently limited to real characteristic values 
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of selfadjoint pencils. In particular, such a formulation applied to the linear pencil 



(1.4) related to the Hamiltonian spectral problem (1.1) does not provide any direct 
information about a(JL) off the imaginary axis. But such a characterization seems 
unnecessary, as the (traditional) Krein signature of non-imaginary points of o~(JL) is 
easy to calculate and is equal to zero. Nevertheless it would be interesting to determine 
whether an approach similar to the one presented here can be applied to study non- 
selfadjoint pencils and/or to detect further information about non-real characteristic 
values of selfadjoint pencils. 

Our main new result, the generalization of the notion of an Evans function to that 
of an Evans-Krein function, simplifies numerical calculation of Krein signatures from 
an Evans function and is a method easy to incorporate into existing codes. From 
the theoretical perspective an interesting problem would be to find an intrinsically 
geometrical interpretation of Evans-Krein functions similar to the characterization of 
Evans functions given by Alexander et al. PQ. 

The graphical nature of the signature should also allow us to generalize the index 
theorems presented in ^5] to handle operators on infinite-dimensional Hilbert spaces 
with general kernels, sidestepping certain unnecessary technical difficulties. One also 
expects to be able to identify optimal (from the graphical point of view) assumptions 
for the validity of these theorems. We advocate that, in general, the concept of the 
graphical signature is often more suitable for analysis (either rigorous or numerical) 
than the traditional one as it does not rely on any particular algebraic structure of 
the operator pencil. This may possibly allow further applications of signature argu- 
ments for non-polynomial pencils where the traditional definition based on indefinite 
quadratic forms falls flat. 

Furthermore, the graphical approach to Krein signature theory may be preferable 
for understanding various mechanisms for avoiding Hamiltonian-Hopf bifurcations 55, 
158] in Hamiltonian systems with a tuning parameter t£R. Of particular interest here 
are "non-generic" collisions (in the sense of Arnold [2]) involving two real characteristic 
values of a selfadjoint pencil having opposite Krein signatures but that nonetheless 
do not bifurcate from the real axis. The question to be addressed is the fate of the 
various eigenvalue curves \x = //(A) of ( 1.5 ) when the spectral problem is perturbed in 
an admissible fashion. Here, one expects that certain conditions may ensure that not 
only do real characteristic values of opposite signatures survive collisions, but also the 
transversal intersection of eigenvalue branches [i = fJ.(X) is preserved as well. Such a 
mechanism for avoiding Hamiltonian-Hopf bifurcations is significantly different from 
the one described by Krein and Ljubarskii [49] in the context of differential equations 
with periodic coefficients. Here we only present an illustrative example of what we 
have in mind; we expect to publish elsewhere new results given necessary and sufficient 
conditions for the preservation of branch crossings. 

< Example 14- Nongeneric perturbations can avoid Hamiltonian-Hopf bifurcations. 

A-fT, K = (iJ) _1 , where L depends linearly on a 
tB where A := diag(5, 1, 3, 2), and where 



Consider the pencil 


£(A) : 


= L 




control parameter t 
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L = 
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(6.1) 



Fig. |6.1| illustrates how transversal intersections between the eigenvalue branches 
/i = /i(A) present for t — can persist for t ^ 0. For general perturbations of 
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L = A one would expect all four transversal intersections to simultaneously "break" 
for t 0, resulting in four uniformly ordered analytic eigenvalue branches /ii(A) < 
A*2(A) < /^(A) < A*4(A), an( i if these branches have critical points, Hamiltonian-Hopf 
bifurcations resulting in the loss of one or more pairs of real characteristic values be- 
comes possible as the branches shift vertically. Therefore the perturbation B has to be 
quite special for the intersections to persist, a situation that prevents all Hamiltonian- 
Hopf bifurcations from occurring. > 




Fig. 6.1. Spectrum of L(t) — XK plotted against X for the matrices in Example \ll\ Left panel: 
t = 0. Center panel: t = I (exhibiting a necessarily harmless collision between characteristic values 
of the same Krein signature). Right panel: t = 4. Between t = 1 and t = 4 two real characteristic 
values of opposite Krein signature pass through each other unscathed due to the preservation of 
transversal crossings of eigenvalue branches fi = n(X) . 

Finally, we would like to stress that mostly for the sake of keeping our presentation 
as simple and accessible as possible, we have chosen to work in finite-dimensional 
spaces (except in Sections [2] and |4|. Extending the theory to operators on infinite- 
dimensional spaces and general self-adjoint operator pencils is a task that has both 
straightforward aspects and also technically difficult aspects. One of the central open 
questions in the field of stability of nonlinear waves is whether it is possible to prove an 
analogue of Corollary [53] in the case of an unbounded operator </, as in the important 
case J = d x relevant to the stability of solutions to the Korteweg-de Vries equation 
[64j . The analysis presented here suggests that once the key assumption of continuity 
of eigenvalue branches of the pencil C(X) is met, the theory developed for bounded 
operators can be generalized to unbounded operators. Our results in this direction 
will be published elsewhere. 
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